Dopamine receptors regulate preference between high-effort and high-risk rewards

Gabriel, Daniel B. K.; Liley, Anna E.; Freels, Timothy G.; Simon, Nicholas W.

doi:10.1007/s00213-020-05745-z

Dopamine receptors regulate preference between high-effort and high-risk rewards

Original Investigation
Published: 07 January 2021

Volume 238, pages 991–1004, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Psychopharmacology Aims and scope Submit manuscript

Dopamine receptors regulate preference between high-effort and high-risk rewards

Download PDF

Daniel B. K. Gabriel¹,
Anna E. Liley¹,
Timothy G. Freels² &
…
Nicholas W. Simon ORCID: orcid.org/0000-0001-9056-3509¹

992 Accesses
4 Citations
4 Altmetric
Explore all metrics

Abstract

Rationale

Optimal decision-making necessitates evaluation of multiple rewards that are each offset by distinct costs, such as high effort requirement or high risk of failure. The neurotransmitter dopamine is fundamental toward these cost-benefit analyses, and D1-like and D2-like dopamine receptors differently modulate the reward-discounting effects of both effort and risk. However, measuring the role of dopamine in regulating decision-making between options associated with distinct costs exceeds the scope of traditional rodent economic decision-making paradigms.

Objectives

We developed the effort vs probability economic conflict task (EvP) to model multimodal economic decision-making in rats. This task measures choice between two rewards of uniform magnitude associated with either a high effort requirement or risk of reward omission. We then tested the modulatory effects of systemic cocaine and D1/D2 blockade or activation on the preference between high-effort and high-risk alternatives.

Methods

In the EvP, two reinforcers of equal magnitude are associated with either (1) an effort requirement that increases throughout the session (1, 5, 10, and 20 lever presses), or (2) a low probability of reward receipt (25% of probabilistic choices). Critically, the reinforcer for each choice is comparable (one pellet), which eliminates the influence of magnitude discrimination on the decision-making process. After establishing the task, the dopamine transporter blocker cocaine and D1/D2 antagonists and agonists were administered prior to EvP performance.

Results

Preference shifted away from either effortful or probabilistic choice when either option became more costly, and this preference was highly variable between subjects and stable over time. Cocaine, D1 activation, and D2 blockade produced limited, dose-dependent shifts in choice preference contingent on high or low effort conditions. In contrast, D2 activation across multiple doses evoked a robust shift from effortful to risky choice that was evident even when clearly disadvantageous.

Conclusions

The EvP clearly demonstrates that rats can evaluate distinct effortful or risky costs associated with rewards of comparable magnitude, and shift preference away from either option with increasing cost. This preference is more tightly linked to D2 than D1 receptor manipulation, suggesting D2-like receptors as a possible therapeutic target for maladaptive biases toward risk-taking over effort.

Dopamine Antagonism Decreases Willingness to Expend Physical, But Not Cognitive, Effort: A Comparison of Two Rodent Cost/Benefit Decision-Making Tasks

Article 20 October 2014

Striatal dopamine D2 receptors regulate effort but not value-based decision making and alter the dopaminergic encoding of cost

Article 20 July 2018

Rodent Models of Adaptive Value Learning and Decision-Making

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Economic decision-making refers to the complex cost-benefit analyses that guide value-based choices (Rangel et al. 2008; Kalenscher and van Wingerden 2011). During this process, each reinforcer must be integrated with factors that detract from, or “discount”, the subjective economic value of that option (Montague and Berns 2002; Bechara 2005; Rangel et al. 2008). A recurring decision-making scenario involves the conflict between high effort and high risk options (Winstanley and Floresco 2016). For example, appraising methods of paying for food and housing may necessitate choice between steady work for a reliable paycheck vs. a trip to a nearby casino. While each option may have a comparable ideal outcome (sufficient earnings to pay bills), overall value will differ according to evaluation of each option’s discounting factors, in this case, the effort expenditure of a job vs the risk of losing one’s savings at the casino. Comparison between choices involving effortful and probabilistic costs is commonly disrupted in pathologies of decision-making, such as the bias in substance use disorder toward seeking high risk/low effort reinforcement (Brevers et al. 2014; Saddoris et al. 2016; Verdejo-García et al. 2018).

Animal models provide a necessary avenue toward deconstructing the economic decision-making process (Floresco et al. 2008a, b; Orsini et al. 2015; Winstanley and Floresco 2016). Standard paradigms study economic decision-making by measuring choice between a small and large reinforcer associated with a dynamic discounting factor such as delay, risk, or effort (Evenden and Ryan 1996; Cardinal and Howes 2005; Floresco et al. 2008b; Winstanley and Floresco 2016). However, these designs fail to capture situations in which each reinforcer is associated with a distinct cost, which is often the case in real-world decision-making. To this end, we developed the effort vs probability (EvP) economic conflict task to model complex choices between high-effort and high-risk options. In the EvP, two reinforcers of equal magnitude are each associated with a distinct discounting factor: (1) an effort requirement that increases throughout the session (effortful choice), or (2) a low probability of reinforcement (probabilistic choice). The EvP begins with the effortful choice requiring only a single lever press for reinforcement vs the probabilistic choice yielding a 25% chance of reinforcement, rendering the effortful choice the objectively superior option. However, as the task advances, the effortful choice requires progressively more effort (5, 10, and 20 lever presses), whereas the probabilistic choice maintains a single lever press at a fixed delivery probability of 25%. Critically, the reinforcer associated with each choice is comparable in value (one pellet), which eliminates the influence of magnitude discrimination on the decision-making process. We predicted that rats would be able to evaluate both discounting factors to guide decision-making, initially preferring effortful choice, then shifting preference toward the probabilistic option as effort requirements increased.

The neurotransmitter dopamine is particularly critical for mediating value-based decision-making (Salamone et al. 2016; Schultz 2016; Schultz et al. 2017; Burke et al. 2018), and plays a significant role in the evaluation of both effort and probability. Cocaine and amphetamine, drugs of abuse that both enhance dopamine transmission, have contrasting and sometimes complex effects on decision-making. Cocaine reduces willingness to expend high levels of effort, increases tolerance for risk in pursuit of reinforcers, and causes perseverative behavior in decision-making with risk of punishment (van Haaren and Anderson 1994; Verdejo-García et al. 2007; Simon et al. 2009; Peña-Oliver et al. 2014). Conversely, elevated dopamine signaling caused by amphetamine can increase or decrease risk tolerance contingent on the identity of punishment (risk vs reward omission) and the manner in which the cost of uncertainty is manipulated throughout a task (Simon et al. 2009; St. Onge and Floresco 2009; St. Onge et al. 2010; Yang et al. 2018). Amphetamine also exerts dose dependent effects on effortful behavior, with high doses attenuating effort expenditure in pursuit of reinforcers, whereas lower doses or the slow release prodrug lisdexamfetamine increase effort expenditure during easier effort requirements (Floresco et al. 2008b; Yohn et al. 2016).

Manipulation of specific dopamine receptor subtypes has also been shown to influence risky and effortful decision-making. Blockade of D1-like dopamine receptors reduces probabilistic choice, while D1 stimulation increases preference for uncertain reinforcement (St. Onge and Floresco 2009). Similarly, probabilistic choice is reduced by D2 antagonists and increased by D2 agonists (St. Onge and Floresco 2009; Grall-Bronnec et al. 2016). Manipulation of specific receptor subtypes produces similar effects on effortful choice. Effortful choice is reduced by D1 blockade (Nunes et al. 2010; Yohn et al. 2015; Salamone et al. 2016). Conversely, D1 activation increases effortful choice, in addition to deterring the effort aversion elicited by D1 antagonism (Yohn et al. 2015; Soutschek et al. 2020). D2 antagonists also reduce effortful choice (Yohn et al. 2015), though their stimulation does not affect effort expenditure for food (Zhang et al. 2010).

The contribution of both D1- and D2-like receptors to the evaluation of both effort and probability suggests a role for these receptors when both factors are evaluated prior to a decision. To this end, we tested the role of dopaminergic activity during effort vs risk economic conflict. First, we measured behavior in EvP to confirm that subjects were able to accurately discern between effortful and risky options. We then used acute exposure to the psychostimulant cocaine, a dopamine transporter inhibitor that increases synaptic dopamine concentration, to test impact of a general increase in synaptic dopamine on EvP. Finally, we assessed the acute effects of systemic D1- and D2-like dopamine receptor activation or blockade on effortful vs probabilistic choice.

Materials and methods

Subjects

Male Long-Evans rats (n = 23, Envigo Corp) were obtained at approximately three months of age. Subjects arrived pair-housed with ad libitum access to food and water and were kept on a 12-h light/dark cycle beginning with lights off at 7:30 am. One week after arrival, subjects were food restricted to 90% of their free feeding baseline weight to increase motivation in behavioral tasks. Baseline weight was increased in accordance with the growth projection provided by Envigo to allow for growth with age, resulting in approximately 5 g increases per week until 15 weeks of age. Pairs were separated if aggression or food dominance was observed. All protocols were approved by the University of Memphis Institutional Animal Care and Use Committee.

Behavioral apparatus

Behavior and decision-making processes were measured in MedAssociates (FairFax, VA) operant conditioning chambers in soundproof cubicles. Each were equipped with one retractable lever on each side of an illuminable food trough with 0.635 cm recessed photobeams to track entries, pellet dispenser, metal floor grate, and locomotion tracking photobeams. Sugar pellets weighing 45 mg (Bio-Serv, Flemington, NJ) were used as a behavioral reinforcer.

Instrumental shaping

Initial shaping procedures followed previous protocols (Gabriel et al. 2019; Orsini and Simon 2020). To reduce neophobia, rats were placed in their operant chamber for 5 min and had sugar pellets place in their home cages the day prior to shaping. Shaping began with magazine training in which 38 food pellets were delivered at a rate of 1 every 30 ± 10 s to teach subjects to associate the food trough with pellet delivery. Pellet delivery was accompanied by illumination of the food trough, which remained on until subjects collected the pellets. After collecting all food reinforcers, subjects were trained to exert a single lever press for pellet delivery. In a counterbalanced order, each lever was extended by itself until subjects obtained 50 reinforcers, after which the opposite lever was extended until delivering 50 reinforcers.

Finally, rats learned to nose poke into the trough upon simultaneous illumination of the house light and trough. This response extinguished the food trough light and extended a single lever, which rats pressed for reinforcer delivery (1 pellet) and food trough illumination. Levers were presented in pseudorandom order, with neither extended more than twice sequentially, and retracted immediately when pressed. Lights were extinguished 10 s after reinforcer delivery or upon reinforcer collection and an intertrial interval (ITI) of 10 ± 4 s preceded the next trial. Failure to respond within 10 s of food trough illumination or lever extension resulted in the trial being marked an omission, termination of all stimuli, and ITI initiation. Upon achieving 35 presses on each lever in a session, subjects began training in the EvP.

The effort vs probability economic conflict task

The EvP measured decision-making between two reinforcers that were equivalent in magnitude, but associated with distinct costs: either effort or probability (Fig. 1a). The EvP consisted of 4 blocks of 18 trials each. Each trial began with illumination of the food trough, after which a head entry into the trough extinguished the light and evoked extension of either a single lever (forced choice trial) or two levers (free choice trial). A press on either of these levers was associated with receipt of a single sugar pellet as well as a unique discounting factor. One lever was associated with low probability of pellet delivery (25%), and the other a dynamic effort requirement (fixed ratio 1, 5, 10, or 20). The identity of these levers was counterbalanced between subjects and held consistent throughout training. After a single press of the probabilistic lever, both levers were retracted until the next trial. After an effortful choice, the probabilistic lever was retracted, and the effortful lever remained extended until the subject completed the requisite response sequence. Each block began with 8 forced choice trials with a single lever extended to establish changes in action-outcome contingencies throughout the session, followed by 10 free choice (dual lever) trials in which rats chose between the probabilistic and effortful choice levers. After pellet delivery (or after the lever press in probabilistic trials in which no reinforcer was given), trials proceeded to a 10 ± 4 s ITI in which all lights were extinguished, and levers retracted. Trial omissions resulted from failure to either initiate a trial or select a lever within 10 s resulted in immediate procession to the ITI. Failure to complete the effort requirement within 2 min also resulted in the trial being marked as an omission and progressing to the ITI.

Diminishing probability vs fixed effort economic conflict task

The EvP held the probabilistic choice at a constant rate of reinforcer delivery (25%) and increased the effortful choice’s lever press requirement (1, 5, 10, or 20). To confirm behavior was not solely a product of the variable effort/fixed probability design, a subset of rats performed in an inverted task wherein the effortful choice was fixed at 10 lever presses throughout the session, and the rate of delivery for the probabilistic choice decreased throughout the session (100%, 75%, 50%, 25%). Forced and free choice trials were identical to the standard EvP in every respect other than these changes to the discounting factors.

Economic conflict between probability and effort-equivalent delays

Completion of effort requirements caused a delay between the decision and reinforcement equal to the time required to complete the effortful fixed ratio. To confirm that discounting of the high-effort option in EvP was not solely caused by this delay, a subset of subjects performed a modified task wherein the effortful choice was replaced with a single lever press requirement that delivered a pellet after a delay (0 s, 1.6 s, 3.5 s, or 7.5 s). These delays were determined by taking an average of the time in seconds required to complete each effort requirement in the standard EvP across the final 5 days of training.

Pharmacological manipulations

All acute pharmacological dopamine manipulations were performed using an 8-day drug administration protocol (Simon et al. 2011). On days 1, 3, 5, and 7, subjects received an intraperitoneal injection of the appropriate drug solution prior to testing. Dose sequence was counterbalanced across subjects for every drug regimen. Days 2, 4, 6, and 8 consisted only of behavior in the EvP, serving as controls to ensure no carryover effects of drug administration. All drugs were suspended in .9% saline solution.

The dopamine transporter inhibitor cocaine hydrochloride (0, 2, 5, 10 mg/kg, Sigma-Aldrich), D1 agonist SKF81297 (0, .1, .3, 1.0 mg/kg, Cayman Chemicals), and D2 agonist Quinpirole hydrochloride (0, 0.0375, 0.125, 0.25 mg/kg, Sigma-Aldrich) were all administered 5 min prior to behavior in the EvP. The D1 antagonist SCH23390 hydrobromide (0, 0.005, 0.01, 0.03 mg/kg, Sigma-Aldrich) and D2 antagonist Eticlopride hydrochloride (0, 0.01, 0.03, 0.05 mg/kg, Sigma-Aldrich) were administered 20 min prior to behavior. Drug doses and timing of injections were based on previous studies with these drugs in similar decision-making tasks (Simon et al. 2009, 2011; Winstanley et al. 2011). Drug conditions were tested in the following order: Cocaine, SCH23390, Eticlopride, Quinpirole, SKF81297, with a minimum of 1 week between each new drug regimen. After completion of all drug treatments (55.43 ± 7.67 days for two cohorts), baseline behavior was again measured over a 3-day span, and compared to predrug baseline levels to assess long-term stability of EvP decision-making.

Experimental timeline

Subjects were divided into two cohorts and trained in the EvP until preference between effortful and probabilistic choice was stable. Upon reaching stability, each cohort progressed along a distinct experiment series (Table 1). In brief, cohort 1 received D1/D2 antagonists and agonists, with a minimum of 1-week separating administration of each drug. Cohort 1’s behavior concluded with a no-drug retest in EvP to test if choice preference in the EvP was stable. In cohort 2, effects of cocaine on EvP performance were assessed, followed by a no-drug test of EvP stability. Cohort 2 then trained in variations of the EvP to test the effect of manipulating delay in place of effort and making the probabilistic cost variable and effort fixed. Finally, the effects of D2 activation were measured on EvP with variable probability and fixed effort.

Table 1 Order and duration of experiments conducted with experimental cohorts 1 and 2

Full size table

Data analysis

Subjects performed each task until stable over a 3-day period, determined as a lack of interaction between day and task block in a day × block repeated measures ANOVA. Stability was achieved after an average of 16.25 sessions for all three tasks (EvP, fixed effort vs diminishing probability, and fixed probability vs effort-equivalent delays). Behavior was averaged over the stable period to provide a representative baseline of pre-treatment behavior. Decision preference in EvP was quantified using (1) percent choice of the effortful option across all blocks, and (2) area under the curve (AUC) of percent effortful choice for blocks 2–4. AUC was calculated geometrically as the summed area of a 2 trapezoids created by drawing vertical lines from the x-axis to percent choice of the effortful option in blocks 2, 3, and 4 of the task, i.e., AUC = (B2 % effort + B3 % effort)/2 + (B3 % effort + B4 % effort)/2. This approach has previously been utilized for dose-response curves, and these data were found to be highly correlated with approaches used in classical behavioral economic studies (Myerson et al. 2001; Amlung et al. 2015).

Block 1 was omitted in calculating AUC to maintain a focus on conditions of true economic conflict, as this is the only block with no effort vs probability conflict (single lever press for a reinforcer vs a single lever press with a 25% chance of a reinforcer) and an objectively superior choice. AUC was used in both Pearson’s correlations and paired-samples t tests to compare decision-making between tasks. The indifference point in EvP was defined as the combination of effort and probability where choice was most evenly split between the two options. The delay associated with each effort requirement in the EvP was determined by averaging the time in seconds that subjects required to complete each fixed ratio.

Effects of pharmacological manipulation were analyzed using dose × block (4 × 4) repeated measures ANOVAs. Significant main effects were investigated using pairwise comparisons to identify which doses differed from saline using the EMMEANS and Compare subcommands within the IBM® SPSS® Statistics 26 syntax. Significance was computed based on estimated marginal means as predicted for each dose (saline, low, mid, and high) collapsed across all 4 blocks of the EvP. Because these comparisons collapsed each dose across all effort requirements, each significant saline-dose pair was further analyzed with a follow-up dose × block (2 × 4) repeated measures ANOVA to identify under which effort requirement behavior was changed compared to saline. Dose × effort requirement interactions were investigated through separate repeated measures ANOVAs for each effort requirement where dose served as the within-subjects variable and pairwise comparisons were used to identify which doses differed from saline at each effort requirement.

In cases where drug administration led to omissions of entire blocks of a task, missing data were interpolated using slope between adjacent blocks. In sessions where a subject fully omitted at least one block of trials, data from that subject was not included in repeated-measures ANOVA analyses due to the listwise deletion of asymmetrical data. To confirm that interpretation of dopaminergic manipulations was not skewed by subjects excluded from ANOVAs, linear mixed effects models were created in Matlab 2020b (Mathworks; Natick, MA) for the effects of each. Fixed factors were defined as drug concentration, effort requirement, and the dose × effort interaction, subject served as the random factor, and the fit of these models was then compared to a degenerate model consisting of a single constant term. The Greenhouse-Geisser correction is reported for all statistics in which the assumption of sphericity was violated.

Results

Experiment 1: the economic conflict task

The economic conflict task (Fig. 1a) offers rats a choice between two equal-sized pellet rewards, each associated with either a low probability of delivery (25%) or a dynamic effort requirement that increased throughout the session (1, 5, 10, and 20 lever presses). In the first block of the task, when both options were associated with a single lever press, subjects displayed definite preference for “effortful” over probabilistic reinforcement (M = 86.40, SEM = 3.01, Fig. 1b). However, as the effort requirement increased across blocks rats shifted preference (F (1.869, 41.121) = 53.608, p < .001, Fig. 1b), ultimately preferring reinforcers associated with probabilistic over effortful costs (M = 23.27, SEM = 5.93, Fig. 1b). This suggests that rats can compare and distinguish between multiple discounting factors, in this case probability and effort requirement. There was a substantial degree of individual variability between subjects (Fig. 1c), which is commonly observed in other cost-benefit decision-making tasks (Simon et al. 2011; Freels et al. 2020). Importantly, AUC for choice of the effortful over probabilistic option was strongly correlated at the initial predrug baseline period and after culmination of all drug treatments (55.4 ± 7.7 days later, r = 0.856, p < .001, Fig. 1d), indicating that preference for reinforcement associated with effortful vs probabilistic costs was highly stable over long periods of time. Furthermore, baseline effortful vs probabilistic choice was consistent across all drug administration schedules, as indicated by a lack of difference within saline sessions across all schedules (F (3, 27) = 1.718, p = 0.187) and no schedule by effort requirement interaction (F (4.933, 44.398) = 1.129, p = 0.359).

Next, we confirmed that behavior during the economic conflict task was due to true economic conflict and not to one discounting factor remaining constant while another changed. This was accomplished by training a subset of subjects in a modified task measuring preference between a fixed effort requirement of 10 lever presses and reinforcement delivered with decreasing probability throughout the session (100%, 75%, 50%, 25%). As expected, choice in block 1 favored the objectively advantageous option, such that rats preferred the reinforcer associated with 100% “probabilistic” cost over the reinforcer associated with a 10 press effort requirement (M = 82.99, SEM = 7.28, Fig. 2a). Critically, preference shifted toward the effortful option as the probability of reinforcement was reduced (F (3, 33) = 8.582, p < 0.001, Fig. 2a). We estimated the “indifference point” (approximately 50% choice of both options) to be block 3 of the standard EvP (10 lever press vs 25% delivery rate, M = 51.03, SEM = 5.85, Fig. 1b), and determined that preference between options during this block point was comparable between the EvP and the modified economic conflict task (t (11) = 0.387, p = 0.706, Fig. 2b). Furthermore, there were no differences in overall preference for effortful vs probabilistic choice (t (11) = 0.063, p = 0.951) as well as a near significant correlation in AUC between tasks (r (11) = 0.509, p = 0.091, Fig. 2c), suggesting that behavior during economic conflict is consistent regardless of the varying discounting factor.

Selection of the effortful choice causes a delay before reinforcement equal to the time necessary to finish the lever press requirement. It is possible that aversion to this delay, rather than the physical effort requirement, drove the shift from effortful to probabilistic choice. Therefore, we designed a modified EvP in which choice was between a probabilistic (25%) reinforcer vs. reinforcement delivered after a delay that increased throughout the session. Each delay corresponded to the average time subjects required between the first lever press and completion of the effort requirement in the original task. These mean delays were calculated as 0 s, 1.6 s, 3.5 s, and 7.5 s corresponding respectively with the 1, 5, 10, and 20 lever press effort requirements. Rats previously trained in the EvP (Fig. 3a) displayed preference for the immediate over probabilistic reinforcement in the first block of the effort equivalent delay vs fixed probability task (M = 81.39, SEM = 9.11, Fig. 3b), then shifted choice preference away from the delayed toward the probabilistic option as delay preceding reward delivery increased (F (1.578, 17.353) = 4.271, p = 0.039, Fig. 3b). Preference for probabilistic choice, as measured by AUC, was significantly greater in this delay-based task than the standard EvP (t (11) = 5.250, p < 0.001, Fig. 3c), and there was no correlation between the effort and delay versions of the economic conflict task (r (12) = 0.239, p = 0.454). Thus, replacing effort with an equivalent delay substantially changed choice behavior, suggesting that the shift away from rewards with effortful costs was not solely caused by delay. Rather, the influence of effortful and delayed costs is evaluated differently when being compared to probabilistic costs, with rats demonstrating enhanced discounting of effort compared to delay.

Experiment 2: effect of cocaine on economic conflict

After establishing the economic conflict task, we tested the acute effects of multiple dopaminergic drugs on effort vs probability discounting. First, rats were administered cocaine, a highly reinforcing drug that increases synaptic dopamine (as well as norepinephrine and serotonin) via reuptake inhibition. Overall, cocaine significantly shifted preference from probabilistic to effortful reinforcement (F (3, 33) = 3.626, p = 0.023, Fig. 4a). There was also a dose by effort requirement interaction (F (9, 99) = 2.648, p = 0.009, Fig. 4a). Post hoc tests revealed that, compared to saline, preference for effortful over probabilistic choice was increased by the low and mid doses of cocaine but unaffected by the high dose (low: p = 0.017; mid: p = 0.003; high: p = 0.370). Further analyses found that this shift from probabilistic to effortful choice induced by low and mid doses compared to saline was manifested only in the initial blocks (1 and 5 press requirements, p < 0.015), but was not evident in higher effort requirement blocks (10 and 20 press requirements, p > 0.30). Notably, a small group of subjects that displayed relatively low selection (< 60%) of effortful choice in block one with saline, which may have accounted for the effects of cocaine. However, after removing these subjects, the significant effects of cocaine at the low and mid doses in these blocks persisted, with low and mid doses of cocaine increasing effortful choice compared to saline during the 1 press (low: p = 0.007; mid: p = 0.007) and the 5 press conditions (low: p = 0.047; mid: p = 0.007). No omitted trials occurred at any dose, and AUC for conditions involving true economic conflict (blocks 2–4) did not differ with dose (F (3, 33) = 1.104, p = 0.361).

Experiment 3: role of D1 dopamine receptor subtype during economic conflict

While the effects of cocaine suggest a role for dopamine in the economic conflict task, specific dopamine receptor subtypes have distinct and sometimes opposite effects on economic decision-making (Floresco et al. 2008a). To delineate the role of dopamine receptor subtypes during economic conflict between rewards with effortful and probabilistic costs, agonists and antagonists for D1- and D2-like receptors were systemically administered prior to task performance.

Blockade of D1 receptors with D1 antagonist SCH23390 had no effect on preference between effortful and probabilistic reinforcement (F (3, 30) = 0.950, p = 0.429; dose by block interaction: F (3.831, 38.314) = 0.963, p = 0.436, Fig. 4b). Furthermore, D1 blockade did not modulate AUC (F (3, 30) = 1.331, p = 0.283), and did not affect the number of omitted trials in a session (F (1.222, 12.220) = 3.754, p = 0.070). The D1 agonist SKF81297 hydrobromide also had no effect on choice between the effortful and probabilistic options (F (3, 24) = 0.632, p = 0.601, Fig. 4c) or AUC (F (3, 24) = 0.934, p = 0.440) and did not significantly increase omitted trials (F (1.631, 16.312) = 0.684, p = 0.490) although two subjects were excluded from ANOVA analyses due to omissions. However, D1 activation did elicit a block × dose interaction (F (4.029, 32.235) = 2.853, p = 0.039, Fig. 4c). A significant block by dose interaction was also observed using a linear mixed effects model (LMEM), which is less vulnerable to omissions than an individual ANOVA (Yates 2018) (Table 2). Post hoc analyses revealed that D1 activation shifted preference from effortful to probabilistic choices only at the highest effort requirement (F (3, 24) = 3.047, p = 0.048) and only at the highest dose of the D1 agonist compared to saline (p = 0.034). This suggests that high dose D1 agonist increases the preference for probabilistic over effortful costs, but only during relatively high effort requirements. Thus, systemic D1 receptor activation, but not blockade, has selective effects on decision-making during economic conflict.

Table 2 Linear mixed effects model for drugs that induced omissions. All drug schedules in which at least one subject was excluded from ANOVA analyses at any dose due to omitted trials were also analyzed using linear mixed model, which is more effective than standard ANOVA for processing missing data

Full size table

Experiment 4: role of D2 dopamine receptor subtype during economic conflict

The D2 antagonist Eticlopride hydrochloride caused a significant increase in trial omissions (F (1.802, 18.021) = 6.930, p = .007; saline: M = 0.45 ± 0.28; low: M = 0.64 ± 0.54; mid: M = 10.55 ± 4.15; high: M = 17.10 ± 4.24) which was most evident with higher doses of drug. D2 blockade did not alter choice between probabilistic and effortful options (F (3, 15) = 2.173, p = 0.134, Fig. 5a), and there was no dose by effort requirement interaction (F (9, 45) = 1.100, p = 0.382, Fig. 5a). As detailed in the “Methods” section, percent choice of the effortful option during blocks in which rats omitted all trials was interpolated using the slope between completed blocks. However, five subjects required exclusion from the ANOVA due to omissions beyond the scope of interpolating missing data. An LMEM constructed to control for omissions replicated the null effects of D2 blockade on EvP (Table 2), though the effect of dose did trend toward significance (p = 0.095). However, D2 antagonism significantly reduced AUC across blocks 2–4, the blocks in which effort and risk were in conflict (F (3, 24) = 4.640, p = 0.011). Paired-samples t tests were used to compare AUC of each dose to saline, and the shift toward the risky option became more pronounced as dosage increased (low dose D2 antagonist: t (9) = 2.269, p = 0.049; mid: t (9) = 2.666, p = 0.026; high: t (9) = 4.251, p = 0.002).

Activation of D2 receptors via quinpirole elicited a significant shift in preference from those entailing effortful costs to those with probabilistic costs throughout the entire session (F (3, 27) = 21.315, p < 0.001, Fig. 5b), further illustrated by a reduction of AUC (F (3, 27) = 28.329, p < 0.001). This shift was comparable across all doses compared to saline (low: F (1, 10) = 71.328, p < 0.001; mid: F (1, 9) = 100.761, p < 0.001; High: F (1, 10) = 42.067, p < 0.001, Fig. 5b). This shift toward probabilistic from effortful choice was even observed in the first block (1 press vs 25%) during which there is no economic conflict and the “probabilistic” option is objectively disadvantageous (F (3, 27) = 3.640, p = 0.025, Fig. 5b). There was no dose by block interaction (F (9, 81) = 1.516, p = 0.156), and D2 activation dose dependently increased omitted trials (F (3, 30) = 8.802, p < 0.001). One subject was removed from post hoc comparisons between saline and mid dose sessions due to excessive omissions during behavior after mid dose administration; to account for this, we used LMEM to reanalyze the data, which replicated the significant effect of drug treatment and the lack of interaction (Table 2).

Experiment 5: role of D2 activation during modified economic conflict task

In experiment 4, activating D2 receptors markedly reduced preference for options entailing effort in favor of easier options involving probabilistic risk. During the EvP, the effort requirement became increasingly difficult across the session, while probabilistic choice maintained a consistent 25% rate of reinforcement. This difference in contingencies between options raises the concern that D2 activation shifted preference toward probabilistic over effortful costs by increasing preference for stable (probability) over dynamic (effort) parameters. To address this, we trained subjects in the modified task previously used in experiment 1 in which the effort requirement was fixed at 10 presses throughout the session, and the probabilistic choice became increasingly risky (100%, 75%, 50%, 25% chance of reinforcement), then again tested all doses of the D2 agonist. D2 activation again shifted preference toward the probabilistic option away from the effortful one (F (1.836, 16.522) = 4.812, p = 0.025, Fig. 5c), and elicited a dose by effort requirement interaction (F (9, 81) = 2.243, p = 0.027, Fig. 5c). During true economic conflict (blocks 2–4), AUC was also shifted from the effortful to the probabilistic option (F (1.861, 16.751) = 6.604, p = 0.009). Thus, D2 activation during economic conflict selectively increases preference for probabilistic over effortful costs rather than causing a general preference for consistent costs. As in experiment 4, D2 activation caused a dose-dependent increase in omissions (F (1.921, 21.135) = 5.960, p = 0.009), although they remained low even at the highest dose (M = 2.94, SEM = .85), and 2 subjects were removed from ANOVA analyses due to excessive omissions. The LMEM constructed to control for excluded subjects did not produce a dose effect, though a trend toward dose × effort interaction was identified (Table 2).

Discussion

Conflicts between high effort and high-risk reward seeking (such as a steady job vs. playing the lottery) are a common occurrence in everyday decision-making. We developed the effort vs probability economic conflict task (EvP) to measure choice between rewards of comparable magnitude associated with distinct costs, either physical effort or risk of omission. We observed that rats demonstrate flexible decision-making that shifts toward the other option when either effort or risk is increased, and that intrinsic preference for high effort vs high risk options remains stable over long periods of time. Increasing synaptic dopamine with the psychostimulant cocaine shifted preference from probabilistic to effortful choice, but this was limited to conditions with low effort requirements. D1 dopamine receptors played a limited role in EvP, with blockade eliciting no effect, and D1 agonist dose-dependently increasing probabilistic choice, though this effect was limited to trials with a high effort requirement. D2 receptor blockade shifted preferences away from effort and toward probabilistic costs, but also caused a substantial increase in omissions. D2 activation produced a robust shift in choice of risky over effortful costs across all doses and effort requirements.

Validation of EvP task

Rats were able to detect changes in effortful and probabilistic costs and adjust behavior accordingly, shifting choice toward probabilistic costs as effortful costs became increasingly difficult. This provided a novel translational assessment of complex, real-world analogous decision-making in which all available options include some measure of either effort or risk. Furthermore, most cost-benefit decision-making tasks involve choice between rewards of different magnitude (Wade et al. 2000; Floresco et al. 2008a; Salamone et al. 2016; Orsini et al. 2018), whereas choice in the EvP is unaffected by potential differences in reward magnitude discrimination or preference, as the outcome of all rewarded trials is identical (one pellet). This enables detection of effects of experimental manipulation that are less vulnerable to gross motivational factors.

In the EvP, rewards associated with the probabilistic choice were delivered at a fixed rate of 25% of trials, and the effort requirement increased throughout the session. To confirm that patterns of decision-making were not driven by which factor was fixed/dynamic, subjects trained in an inverted version of the task in which effort-requirement was fixed at 10 lever presses and probability of reinforcement decreased throughout the session. Subjects shifted preference from the probabilistic toward the effortful option as probability decreased, thus demonstrating the propensity to shift away from less valuable options regardless of which variable is dynamic. Moreover, subjects’ decision-making was unchanged by this reversal of effort and probability parameters, demonstrating comparable preference between effortful vs probabilistic costs regardless of which factor was fixed, as well as similar performance at the determined “indifference point”, defined as the block in EvP with the least preference for one option over the other (10 lever presses effort vs 25% probability). This indicates that decision-making behavior in the EvP reflects intrinsic preference between effortful and probabilistic costs that is not strongly confounded by task design.

Choice of the effortful option required time for the subjects to complete all lever presses demanded by the effort requirement, and this time increased throughout the session as effort requirement increased. Therefore, it was difficult to discern if subjects were directly weighing effort vs probability or devaluing the effort as a function of the delay required to complete multiple lever presses (i.e., delay discounting). To address this, subjects trained in another modified EvP which substituted delay for effort. Rats shifted preference toward the probabilistic choice as delay increased; however, choice preference differed from the standard EvP, with rats demonstrating greater discounting of effortful than delayed options. This confirmed that preference for probabilistic over effortful costs in EvP was not solely accounted for by delay discounting, while also revealing that effort is tabulated as a greater cost than delay during comparison with uncertainty.

Finally, we observed that preference between effort and probabilistic costs was highly consistent across 2–3 months, suggesting that preference between effort and risk is a stable behavioral phenotype that is unchanged by the passage of time or transient alterations in neurochemistry. Subjects in each cohort received drug schedules in the same order (although doses were counterbalanced within each schedule); thus, it is possible that there were carryover effects that influenced subsequent drug treatments. However, EvP choice preference between baseline saline sessions was consistent between drug protocols, suggesting that the earlier drug schedules did not exert enduring influences on decision-making. Furthermore, the drastic shift from effortful to probabilistic choice induced by D2 receptor activation was observed in two distinct cohorts that had different drug histories, suggesting that prior exposure to a specific regimen of drugs did not drive this effect. This consistency over time suggests that EvP provides a reliable measurement of individual differences in choice during economic conflict, with some subjects potentially capturing the suboptimal preference for high risk over high effort options observed in many psychiatric populations (Brevers et al. 2014; Saddoris et al. 2016; Verdejo-García et al. 2018).

Effects of cocaine on EvP

The dopamine system regulates value-based decision-making (Killcross et al. 1997; Stopper et al. 2014). Psychostimulants such as cocaine increase mesolimbic dopamine activity via inhibition of the dopamine transporter (Mcelvain and Schenk 1992; Allain et al. 2015). We tested the acute effects of cocaine on behavior in the EvP to measure the effects of a dopaminergic drug with strong abuse potential on preference between effort and probability. The effects of cocaine were highly selective: only low and medium (but not high) dose cocaine affected decision-making, manifesting as a shift in preference from probabilistic to effortful options that was only evident with low effort requirements (1 and 5 LP). This subtle shift in preference from probabilistic toward low effort costs was somewhat surprising, as cocaine has been shown to reduce effort expenditure in high fixed ratio tasks (van Haaren and Anderson 1994), and acute cocaine shifts preference toward risky options in a probabilistic discounting task (Mai et al. 2015). However, it is likely that the effects of cocaine (and possibly other psychostimulants) differ between situations with a single discounting factor (effort or probability) vs. situations with multiple factors in conflict. This dissociation suggests that choosing between effortful and probabilistic options may recruit different neuronal circuitry than decision-making requiring evaluation of only one of these factors.

Interestingly, despite increasing synaptic dopamine activity, effects of cocaine contrast with the shift from probabilistic to effortful options produced by both D1 and D2 activation. Cocaine elevates synaptic norepinephrine and serotonin as well as dopamine (Pierce and Kalivas 1997). Therefore, cocaine effects on EvP may be influenced by these other monoamines, or by interactions between dopamine and these other neurotransmitter systems. Indeed, the noradrenergic system is implicated in the integration of effort with reward (Varazzani et al. 2015), and simultaneous blockade of either dopamine and norepinephrine transporters or norepinephrine and serotonin transporters has been shown to alter decision-making in risky decision-making tasks involving probabilistic reward (Baarendse et al. 2013; Cui et al. 2018). It will be of interest for future studies to test the acute effects of amphetamine on EvP, as amphetamine influences both effortful and risky decision-making and has less impact on serotonin transmission than cocaine (Pierce and Kalivas 1997; Andrews and Lucki 2001; Floresco et al. 2008b; St. Onge and Floresco 2009).

Parsing the roles of dopamine receptor subtypes in EvP

The D1- and D2-like dopamine receptor subtypes have dissociable roles in both effortful and probabilistic choices (Floresco et al. 2008b; St. Onge and Floresco 2009; Orsini et al. 2015; Grall-Bronnec et al. 2016; Salamone et al. 2016). However, the role of specific dopamine receptor subtypes when effortful and probabilistic options are in conflict is undetermined. To address this, we tested the effects of D1- and D2-like agonists and antagonists on choice behavior in the EvP.

D1-like receptor modulation

Previous studies have reported that preference for both probabilistic and effortful reinforcement is reduced by D1 antagonists (St. Onge and Floresco 2009; Yohn et al. 2015). However, when risk and effort costs are weighed against each other in EvP, D1 blockade did not affect decision-making. It is possible that D1 blockade simultaneously increases the discounting of both risky and effortful costs in EvP, thus eliciting no behavioral change in either direction. This also provides further evidence that effort discounting in EvP is evaluated differently than delay discounting, as D1 blockade increased preference for delayed over immediate rewards (Koffarnus et al. 2011; Tian et al. 2019), but did not influence choice in EvP.

D1 activation biased choice toward high risk options and away from effortful ones, though this was selective for trials with the most difficult effort requirement (20 LP) and was only evident at the highest dose (0.3 mg/kg). Previous studies have shown that D1 activation increases both effort and probabilistic choice in separate effort and probabilistic discounting tasks (St. Onge and Floresco 2009; Nunes et al. 2010; Yohn et al. 2015). Rather than these preferences canceling out (as with the D1 antagonist), the D1 agonist causes a selective shift toward risk, suggesting that increased preference for risk is more tightly linked to D1 receptors than preference for effort. Interestingly, high dose systemic D1 agonists in humans cause decreased preference for larger, risky rewards and increased effort expenditure for rewards (Soutschek et al. 2020), which contrasts with the results here. This is likely related to task differences (the human tasks did not involve choice between effortful and probabilistic costs in the same session) or differences in decision-making processes between rodents and humans. In sum, the effects of D1 receptor manipulation on behavior in the EvP suggest that sensitivity to effort may be more tightly linked to D1 receptors than sensitivity to risk in rat models.

D2-like receptor modulation

D2 receptor blockade shifted behavior away from the effortful choice toward the probabilistic option. This is consistent with a report that D2 antagonists reduce overall effortful decision-making (Yohn et al. 2015), though it contrasts with evoked reductions in risky choice (St. Onge and Floresco 2009; Grall-Bronnec et al. 2016). Thus, D2 blockade’s reduction in effortful choice supersedes reductions in risk preference. These effects were dose dependent, with higher doses causing an increased shift away from the effortful choice/toward the risky choice. Critically, the observed effect of D2 blockade in the EvP should be interpreted with caution due to the drastic increase in omitted trials, especially at the higher doses.

The most notable change in decision-making during EvP was elicited by D2 activation, which shifted preference away from effort toward probabilistic costs across all conditions of effort requirement across all doses. This suggests that decision-making in the face of conflicting effortful and probabilistic choices may rely heavily on activation of D2 dopamine receptors. While the effects of systemic D2 agonists on effortful choice have not been clearly identified, stimulating D2 receptors in both the shell and core subregions of the nucleus accumbens reduces effortful choice (Bryce and Floresco 2019). Additionally, D2 receptor activation is closely tied to gambling and preference for uncertain reinforcement in both humans and rats (St. Onge and Floresco 2009; Cocker et al. 2016; Grall-Bronnec et al. 2016; Tremblay et al. 2017). Therefore, it is likely that D2 activation produces a twofold effect on decision-making in EvP, making the explicit cost associated with effortful choice more aversive and reducing the cost of possible reward omission. This summation of effects may explain the extremely robust, dose-independent risk preference evoked by the D2 agonist in EvP.

The D2 agonist-evoked shift from effortful to probabilistic choice persisted in the modified EvP, in which effort requirement was fixed, and the probabilistic cost became increasingly risky. This confirmed that D2 activation makes uncertain/risky choice extremely compelling regardless of whether effort or probability remained fixed throughout the session. The D2 activation–evoked preference for risky choice may arise from summative effects on the discounting of both effortful and risky reinforcement.

Interestingly, D2 activation elicited probabilistic preference in the initial block of EvP, during which the probabilistic option was objectively inferior to the effortful option. In this block, the effortful lever press requirement was only a single action, but the probabilistic option was only reinforced on 25% of trials. Regardless, D2 activation induced a significant shift toward this disadvantageous option, suggesting that D2 activation increases the value of uncertain reinforcement independent of effort level.

Another potential explanation for the observed shift from effortful to probabilistic choice is the role of presynaptic D2 autoreceptors in reducing phasic dopamine release (Ford 2014) and motivation for food rewards (Bello et al. 2012). By attenuating phasic release, D2 autoreceptor activation consequentially reduces postsynaptic D1 activation, which attenuates motivated behavior (Depoortere et al. 1993; Wall et al. 2011; Sugam et al. 2014; Saddoris et al. 2015). This suggests that the shift away from effortful choices may arise from reduced overall motivation to perform instrumental responding due to secondary effects on D1 receptors. This is an unlikely explanation, however, as direct D1 blockade did not affect choice in the EvP. Furthermore, reduced instrumental motivation does not account for the shift from effortful to probabilistic options when effort requirements were identical. It is therefore likely that D2 receptor activation drastically shifts preference from effortful to probabilistic choice by making uncertain rewards highly compelling.

While dose sequence was counterbalanced within each drug schedule, the order of different drug treatments was the same for all subjects in a cohort (Table 1). It is thus possible that previous drug exposure influenced the effect of subsequent manipulations, or that subtle baseline shifts accounted for drug effects. However, it is unlikely that carryover effects accounted for the observed changes in EvP, as drastic D2 agonist-induced shifts from effortful to probabilistic choice preference were observed in both cohorts despite having been previously exposed to different drugs and different lengths of training. In addition, there were no statistical differences between saline administration sessions for all dopamine-specific drug schedules.

Notably, both blockade and activation of D2 receptors evoked comparable effects on decision-making, with any disruption of baseline D2 receptor transmission causing a shift toward risky reinforcement and away from effortful reinforcement. This contrasts with risky decision-making tasks that do not include an effort component, in which D2 activation increases risky choice while D2 antagonism reduces risky choice (St. Onge and Floresco 2009). Thus, adding an effort component to a risky decision-making task appears to alter effects of dopamine receptor manipulation, manifested as a nonlinear relationship between decision-making and D2 receptor blockade/activation. This is consistent with other aspects of cognition that are regulated in nonlinear fashion by dopamine transmission (Robbins 2005), including punishment-based risky decision-making, wherein either high or low levels of D2 receptor abundance in medial prefrontal cortex is associated with inflexible decision-making (Simon et al. 2011).

Conclusions and implications

To our knowledge, this study marks the first instance in which decision-making between two equivalent rewards associated with either high effort or high risk was assessed in rodents. Rats demonstrated the ability to compare risk and effort, then adapt choice in congruence with changing contingencies. The EvP extends insight from tasks involving a single discounting factor (Evenden and Ryan 1996; Cardinal and Howes 2005; Floresco et al. 2008b; Winstanley and Floresco 2016) to more complex situations, demonstrating that, in some cases, the presence of multiple discounting factors modifies how dopaminergic drugs influence decision-making. Critically, this study suggests that dopamine manipulation, particularly via D2-like receptors, may serve as a potential therapeutic avenue for correcting pathologically disordered preference for high risk over high effort outcomes seen in substance use disorder, ADHD, and gambling disorder (Green and Myerson 2004; Crowley et al. 2006; Floresco et al. 2008a; Cocker et al. 2016; Koffarnus and Kaplan 2018).

References

Allain F, Minogianis EA, Roberts DCS, Samaha AN (2015) How fast and how often: the pharmacokinetics of drug use are decisive in addiction. Neurosci Biobehav Rev 56:166–179. https://doi.org/10.1016/j.neubiorev.2015.06.012
Article PubMed Google Scholar
Amlung M, Yurasek A, McCarty KN et al (2015) Area under the curve as a novel metric of behavioral economic demand for alcohol. Exp Clin Psychopharmacol 23:168–175. https://doi.org/10.1037/pha0000014.Area
Article PubMed PubMed Central Google Scholar
Andrews CM, Lucki I (2001) Effects of cocaine on extracellular dopamine and serotonin levels in the nucleus accumbens. Psychopharmacology 155:221–229. https://doi.org/10.1007/s002130100704
Article CAS PubMed Google Scholar
Baarendse PJJ, Winstanley CA, Vanderschuren LJMJ (2013) Simultaneous blockade of dopamine and noradrenaline reuptake promotes disadvantageous decision making in a rat gambling task. Psychopharmacology (Berlin) 225:719–731. https://doi.org/10.1007/s00213-012-2857-z
Article CAS Google Scholar
Bechara A (2005) Decision making, impulse control and loss of willpower to resist drugs: a neurocognitive perspective. Nat Neurosci 8:1458–1463. https://doi.org/10.1038/nn1584
Article CAS PubMed Google Scholar
Bello EP, Mateo Y, Gelman DM et al (2012) Cocaine supersensitivity and enhanced motivation for reward in mice lacking dopamine D2 autoreceptors. Nat Neurosci 14:1033–1038. https://doi.org/10.1038/jid.2014.371
Article CAS Google Scholar
Brevers D, Bechara A, Cleeremans A, Kornreich C, Verbanck P, Noël X (2014) Impaired decision-making under risk in individuals with alcohol dependence. Alcohol Clin Exp Res 38:1924–1931. https://doi.org/10.1111/acer.12447
Article PubMed PubMed Central Google Scholar
Bryce CA, Floresco SB (2019) Alterations in effort-related decision-making induced by stimulation of dopamine D1, D2, D3, and corticotropin-releasing factor receptors in nucleus accumbens subregions. Psychopharmacology 236:2699–2712. https://doi.org/10.1007/s00213-019-05244-w
Article CAS PubMed Google Scholar
Burke CJ, Soutschek A, Weber S, Raja Beharelle A, Fehr E, Haker H, Tobler PN (2018) Dopamine receptor-specific contributions to the computation of value. Neuropsychopharmacology 43:1415–1424. https://doi.org/10.1038/npp.2017.302
Article CAS PubMed PubMed Central Google Scholar
Cardinal RN, Howes NJ (2005) Effects of lesions of the nucleus accumbens core on choice between small certain rewards and large uncertain rewards in rats. BMC Neurosci 6:1–19. https://doi.org/10.1186/1471-2202-6-37
Article Google Scholar
Cocker PJ, Tremblay M, Kaur S, Winstanley CA (2016) Chronic administration of the dopamine D 2 / 3 agonist ropinirole invigorates performance of a rodent slot machine task, potentially indicative of less distractible or compulsive-like gambling behaviour. Psychopharmacology 56:159–166. https://doi.org/10.1007/s00213-016-4447-y
Article CAS Google Scholar
Crowley TJ, Raymond KM, Mikulich-Gilbertson SK, Thompson LL, Lejuez CW (2006) A risk-taking “set” in a novel task among adolescents with serious conduct and substance problems. J Am Acad Child Adolesc Psychiatry 45:175–183. https://doi.org/10.1097/01.chi.0000188893.60551.31
Article PubMed Google Scholar
Cui R, Wang L, Liu L, Ruan H, Li X (2018) Effects of noradrenergic and serotonergic systems on risk-based decision-making and center arena activity in open field in rats. Eur J Pharmacol 841:57–66. https://doi.org/10.1016/j.ejphar.2018.09.026
Article CAS PubMed Google Scholar
Depoortere RY, Li DH, Lane JD, Emmett-Oglesby MW (1993) Parameters of self-administration of cocaine in rats under a progressive-ratio schedule. Pharmacol Biochem Behav 45:539–548. https://doi.org/10.1016/0091-3057(93)90503-L
Article CAS PubMed Google Scholar
Evenden JL, Ryan CN (1996) The pharmacology of impulsive behavior in rats of drugs on response choice with variyng delays of reinforcement. Psychopharmacology 128:161–170
Article CAS Google Scholar
Floresco SB, St. Onge JR, Ghods-Sharifi S, Winstanley CA (2008a) Cortico-limbic-striatal circuits subserving different forms of cost-benefit decision making. Cogn Affect Behav Neurosci 8:375–389. https://doi.org/10.3758/CABN.8.4.375
Article PubMed Google Scholar
Floresco SB, Tse MTLL, Ghods-Sharifi S (2008b) Dopaminergic and glutamatergic regulation of effort- and delay-based decision making. Neuropsychopharmacology 33:1966–1979. https://doi.org/10.1038/sj.npp.1301565
Article CAS PubMed Google Scholar
Ford CP (2014) The Role of D2-Autoreceptors in regulating dopamine neuron activity and transmission. Neuroscience 282:13–22. https://doi.org/10.1016/j.neuroscience.2014.01.025.The
Article CAS PubMed PubMed Central Google Scholar
Freels TG, Gabriel DBK, Lester DB, Simon NW (2020) Risky decision-making predicts dopamine release dynamics in nucleus accumbens shell. Neuropsychopharmacology 45:266–275. https://doi.org/10.1101/572263
Article CAS PubMed Google Scholar
Gabriel DBK, Freels TG, Setlow B, Simon NW (2019) Risky decision-making is associated with impulsive action and sensitivity to first-time nicotine exposure. Behav Brain Res 359:579–288. https://doi.org/10.1016/j.bbr.2018.10.008
Article PubMed Google Scholar
Grall-Bronnec M, Sauvaget A, Perrouin F, Leboucher J, Etcheverrigaray F, Challet-Bouju G, Gaboriau L, Derkinderen P, Jolliet P, Victorri-Vigneau C (2016) Pathological gambling associated with aripiprazole or dopamine replacement therapy. J Clin Psychopharmacol 36:63–70. https://doi.org/10.1097/JCP.0000000000000444
Article CAS PubMed Google Scholar
Green L, Myerson J (2004) A discounting framework for choice with delayed and probabilistic rewards. Psychol Bull 130:769–792. https://doi.org/10.1037/0033-2909.130.5.769
Article PubMed PubMed Central Google Scholar
Kalenscher T, van Wingerden M (2011) Why we should use animals to study economic decision making - a perspective. Front Neurosci 5:1–11. https://doi.org/10.3389/fnins.2011.00082
Article Google Scholar
Killcross AS, Everitt BJ, Robbins TW (1997) Symmetrical effects of amphetamine and alpha-flupenthixol on conditioned punishment and conditioned reinforcement: contrasts with midazolam. Psychopharmacology 129:141–152. https://doi.org/10.1007/s002130050174
Article CAS PubMed Google Scholar
Koffarnus MN, Kaplan BA (2018) Clinical models of decision making in addiction. Pharmacol Biochem Behav 164:71–83. https://doi.org/10.1016/j.pbb.2017.08.010
Article CAS PubMed Google Scholar
Koffarnus MN, Newman AH, Grundt P, Rice KC, Woods JH (2011) Effects of selective dopaminergic compounds on a delay discounting task. Behav Pharmacol 22:300–311. https://doi.org/10.1146/annurev.neuro.31.060407.125627.Brain
Article CAS PubMed PubMed Central Google Scholar
Mai B, Sommer S, Hauber W (2015) Dopamine D1/D2 receptor activity in the nucleus accumbens core but not in the nucleus accumbens shell and orbitofrontal cortex modulates risk-based decision making. Int J Neuropsychopharmacol 18:1–9. https://doi.org/10.1093/ijnp/pyv043
Article CAS Google Scholar
Mcelvain JS, Schenk JO (1992) A multisubstrate mechanism of striatal dopamine uptake and its inhibition by cocaine. Biochem Pharmacol 43:2189–2199. https://doi.org/10.1016/0006-2952(92)90178-L
Article CAS PubMed Google Scholar
Montague PR, Berns GS (2002) Neural economics and the biological substrates of valuation. Neuron 36:265–284. https://doi.org/10.1016/S0896-6273(02)00974-1
Article CAS PubMed Google Scholar
Myerson J, Green L, Warusawitharana M (2001) Area under the curve as a measure of discounting. J Exp Anal Behav 76:235–243. https://doi.org/10.1901/jeab.2001.76-235
Article CAS PubMed PubMed Central Google Scholar
Nunes EJ, Randall PA, Santerre JL, Given AB, Sager TN, Correa M, Salamone JD (2010) Differential effects of selective adenosine antagonists on the effort-related impairments induced by dopamine D1 and D2 antagonism. Neuroscience 170:268–280. https://doi.org/10.1016/j.neuroscience.2010.05.068
Article CAS PubMed PubMed Central Google Scholar
Orsini CA, Simon NW (2020) Reward/punishment-based decision making in rodents. Curr Protoc Neurosci 93:1–21. https://doi.org/10.1002/cpns.100
Article Google Scholar
Orsini CA, Moorman DE, Young JW, Setlow B, Floresco SB (2015) Neural mechanisms regulating different forms of risk-related decision-making: Insights from animal models. Neurosci Biobehav Rev 58:147–167. https://doi.org/10.1016/j.neubiorev.2015.04.009
Article PubMed PubMed Central Google Scholar
Orsini CA, Hernandez CM, Bizon JL, Setlow B (2018) Deconstructing value-based decision making via temporally selective manipulation of neural activity: insights from rodent models. Cogn Affect Behav Neurosci 19:459–476. https://doi.org/10.3758/s13415-018-00649-0
Article Google Scholar
Peña-Oliver Y, Sanchez-Roige S, Stephens DN, Ripley TL (2014) Alpha-synuclein deletion decreases motor impulsivity but does not affect risky decision making in a mouse gambling task. Psychopharmacology 231:2493–2506. https://doi.org/10.1007/s00213-013-3416-y
Article CAS PubMed Google Scholar
Pierce RC, Kalivas PW (1997) A circuitry model of the expression of behavioral sensitization to amphetamine-like psychostimulants. Brain Res Rev 25:192–216. https://doi.org/10.1016/S0165-0173(97)00021-0
Article CAS PubMed Google Scholar
Rangel A, Camerer C, Montague PR (2008) A framework for studying the neurobiology of value-based decision making. Nat Rev Neurosci 9:545–556. https://doi.org/10.1038/nrn2357
Article CAS PubMed PubMed Central Google Scholar
Robbins TW (2005) Chemistry of the mind: neurochemical modulation of prefrontal cortical function. J Comp Neurol 493:140–146. https://doi.org/10.1002/cne.20717
Article CAS PubMed Google Scholar
Saddoris MP, Sugam JA, Stuber GD, Witten IB, Deisseroth K, Carelli RM (2015) Mesolimbic dopamine dynamically tracks, and is causally linked to, discrete aspects of value-based decision making. Biol Psychiatry 77:903–911. https://doi.org/10.1016/j.biopsych.2014.10.024
Article PubMed Google Scholar
Saddoris MP, Wang X, Sugam JA, Carelli RM (2016) Cocaine self-administration experience induces pathological phasic accumbens dopamine signals and abnormal incentive behaviors in drug-abstinent rats. J Neurosci 36:235–250. https://doi.org/10.1523/JNEUROSCI.3468-15.2016
Article CAS PubMed PubMed Central Google Scholar
Salamone JD, Correa M, Yohn SE, Lopez Cruz L, San Miguel N, Alatorre L (2016) The pharmacology of effort-related choice behavior: dopamine, depression, and individual differences. Behav Process 127:3–17. https://doi.org/10.1016/j.beproc.2016.02.008
Article Google Scholar
Schultz W (2016) Dopamine reward prediction-error signalling: a two-component response. Nat Rev Neurosci 17:1079–1084. https://doi.org/10.1038/nrn.2015.26.Dopamine
Article Google Scholar
Schultz W, Stauffer WR, Lak A (2017) The phasic dopamine signal maturing: from reward via behavioural activation to formal economic utility. Curr Opin Neurobiol 43:139–148. https://doi.org/10.1016/j.conb.2017.03.013
Article CAS PubMed Google Scholar
Simon NW, Gilbert RJ, Mayse JD, Bizon JL, Setlow B (2009) Balancing risk and reward: a rat model of risky decision making. Neuropsychopharmacology 34:2208–2217. https://doi.org/10.1038/npp.2009.48
Article PubMed PubMed Central Google Scholar
Simon NW, Montgomery KS, Beas BS, Mitchell MR, LaSarge CL, Mendez IA, Banuelos C, Vokes CM, Taylor AB, Haberman RP, Bizon JL, Setlow B (2011) Dopaminergic modulation of risky decision-making. J Neurosci 31:17460–17470. https://doi.org/10.1523/JNEUROSCI.3772-11.2011
Article CAS PubMed PubMed Central Google Scholar
Soutschek A, Gvozdanovic G, Kozak R, Duvvuri S, de Martinis N, Harel B, Gray DL, Fehr E, Jetter A, Tobler PN (2020) Dopaminergic D1 receptor stimulation affects effort and risk preferences. Biol Psychiatry 87:1–8. https://doi.org/10.1016/j.biopsych.2019.09.002
Article CAS Google Scholar
St. Onge JR, Floresco SB (2009) Dopaminergic modulation of risk-based decision making. Neuropsychopharmacology 34:681–697. https://doi.org/10.1038/npp.2008.121
Article CAS PubMed Google Scholar
St. Onge JR, Chiu YC, Floresco SB (2010) Differential effects of dopaminergic manipulations on risky choice. Psychopharmacology 211:209–221. https://doi.org/10.1007/s00213-010-1883-y
Article CAS PubMed Google Scholar
Stopper CM, Tse MTL, Montes DR, Wiedman CR, Floresco SB (2014) Overriding phasic dopamine signals redirects action selection during risk/reward decision making. Neuron 84:177–189. https://doi.org/10.1016/j.neuron.2014.08.033
Article CAS PubMed Google Scholar
Sugam JA, Saddoris MP, Carelli RM (2014) Nucleus accumbens neurons track behavioral preferences and reward outcomes during risky decision making. Biol Psychiatry 75:807–816. https://doi.org/10.1016/j.biopsych.2013.09.010
Article PubMed Google Scholar
Tian L, Liu X, Mei X, Cui R, Li X (2019) The role of dopamine D1- and D2-like receptors related to muscarinic M1 receptors in impulsive choice in high-impulsive and low-impulsive rats. Pharmacol Biochem Behav 176:43–52. https://doi.org/10.1016/j.pbb.2018.11.005
Article CAS PubMed Google Scholar
Tremblay M, Silveira MM, Kaur S, Hosking JG, Adams WK, Baunez C, Winstanley CA (2017) Chronic D 2/3 agonist ropinirole treatment increases preference for uncertainty in rats regardless of baseline choice patterns. Eur J Neurosci 45:159–166. https://doi.org/10.1111/ejn.13332
Article PubMed Google Scholar
van Haaren F, Anderson K (1994) Effects of cocaine on fixed-interval behavior and schedule-induced alcohol consumption in male and female rats. Pharmacol Biochem Behav 47:997–1002. https://doi.org/10.1016/0091-3057(94)90311-5
Article PubMed Google Scholar
Varazzani C, San-Galli A, Gilardeau S, Bouret S (2015) Noradrenaline and dopamine neurons in the reward/effort trade-off: a direct electrophysiological comparison in behaving monkeys. J Neurosci 35:7866–7877. https://doi.org/10.1523/JNEUROSCI.0454-15.2015
Article CAS PubMed PubMed Central Google Scholar
Verdejo-García AJ, Perales JC, Pérez-García M (2007) Cognitive impulsivity in cocaine and heroin polysubstance abusers. Addict Behav 32:950–966. https://doi.org/10.1016/j.addbeh.2006.06.032
Article PubMed Google Scholar
Verdejo-García A, Chong TT-J, Stout JC et al (2018) Stages of dysfunctional decision-making in addiction. Pharmacol Biochem Behav 164:99–105. https://doi.org/10.1016/j.pbb.2017.02.003
Article CAS PubMed Google Scholar
Wade TR, De Wit H, Richards JB (2000) Effects of dopaminergic drugs on delayed reward as a measure of impulsive behavior in rats. Psychopharmacology 150:90–101. https://doi.org/10.1007/s002130000402
Article CAS PubMed Google Scholar
Wall VZ, Parker JG, Fadok JP, Darvas M, Zweifel L, Palmiter RD (2011) A behavioral genetics approach to understanding D1 receptor involvement in phasic dopamine signaling. Mol Cell Neurosci 46:21–31. https://doi.org/10.1016/j.mcn.2010.09.011
Article CAS PubMed Google Scholar
Winstanley CA, Floresco SB (2016) Deciphering decision making: variation in animal models of effort- and uncertainty-based choice reveals distinct neural circuitries underlying core cognitive processes. J Neurosci 36:12069–12079. https://doi.org/10.1523/JNEUROSCI.1713-16.2016
Article CAS PubMed PubMed Central Google Scholar
Winstanley CA, Cocker PJ, Rogers RD (2011) Dopamine modulates reward expectancy during performance of a slot machine task in rats: Evidence for a near-miss effect. Neuropsychopharmacology 36:913–925. https://doi.org/10.1038/npp.2010.230
Article CAS PubMed PubMed Central Google Scholar
Yang JH, Cheng CP, Liao RM (2018) Effects of d-amphetamine on risk choice in rats depend on the manner in which the expected reward value is varied. Pharmacol Biochem Behav 171:20–29. https://doi.org/10.1016/j.pbb.2018.05.008
Article CAS PubMed Google Scholar
Yates JR (2018) Dissecting drug effects in preclinical models of impulsive choice: emphasis on glutamatergic compounds. Psychopharmacol 235:607–626. https://doi.org/10.1007/s00213-017-4825-0.Dissecting
Article CAS Google Scholar
Yohn SE, Santerre JL, Nunes EJ, Kozak R, Podurgiel SJ, Correa M, Salamone JD (2015) The role of dopamine D1 receptor transmission in effort-related choice behavior: effects of D1 agonists. Pharmacol Biochem Behav 135:217–226. https://doi.org/10.1016/j.pbb.2015.05.003
Article CAS PubMed Google Scholar
Yohn SE, Lopez-Cruz L, Hutson PH, Correa M, Salamone JD (2016) Effects of lisdexamfetamine and s-citalopram, alone and in combination, on effort-related choice behavior in the rat. Psychopharmacology 233:949–960. https://doi.org/10.1007/s00213-015-4176-7
Article CAS PubMed Google Scholar
Zhang D, Wang X, Xiang X, Chen H, Zhang J, Su Q, Hao W (2010) The dopamine D 2 partial agonist and antagonist terguride decreases heroin self-administration on fixed- and progressive-ratio schedules. Pharmacol Biochem Behav 97:222–226. https://doi.org/10.1016/j.pbb.2010.08.002
Article CAS PubMed Google Scholar

Download references

Funding

This work was supported by the National Institute on Drug Abuse (R15DA046797), a Young Investigator Award from the Brain and Behavior Research Foundation, and a Faculty Research Grant from the University of Memphis (NWS).

Author information

Authors and Affiliations

Department of Psychology, University of Memphis, 400 Innovation Drive, Memphis, TN, 38112, USA
Daniel B. K. Gabriel, Anna E. Liley & Nicholas W. Simon
Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, 55 N Lake Ave, Worcester, MA, 01655, USA
Timothy G. Freels

Authors

Daniel B. K. Gabriel
View author publications
You can also search for this author in PubMed Google Scholar
Anna E. Liley
View author publications
You can also search for this author in PubMed Google Scholar
Timothy G. Freels
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas W. Simon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Nicholas W. Simon.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gabriel, D.B.K., Liley, A.E., Freels, T.G. et al. Dopamine receptors regulate preference between high-effort and high-risk rewards. Psychopharmacology 238, 991–1004 (2021). https://doi.org/10.1007/s00213-020-05745-z

Download citation

Received: 17 August 2020
Accepted: 01 December 2020
Published: 07 January 2021
Issue Date: April 2021
DOI: https://doi.org/10.1007/s00213-020-05745-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Dopamine receptors regulate preference between high-effort and high-risk rewards

Abstract

Rationale

Objectives

Methods

Results

Conclusions

Similar content being viewed by others

Dopamine Antagonism Decreases Willingness to Expend Physical, But Not Cognitive, Effort: A Comparison of Two Rodent Cost/Benefit Decision-Making Tasks

Striatal dopamine D2 receptors regulate effort but not value-based decision making and alter the dopaminergic encoding of cost

Rodent Models of Adaptive Value Learning and Decision-Making

Introduction

Materials and methods

Subjects

Behavioral apparatus

Instrumental shaping

The effort vs probability economic conflict task

Diminishing probability vs fixed effort economic conflict task

Economic conflict between probability and effort-equivalent delays

Pharmacological manipulations

Experimental timeline

Data analysis

Results

Experiment 1: the economic conflict task

Experiment 2: effect of cocaine on economic conflict

Experiment 3: role of D1 dopamine receptor subtype during economic conflict

Experiment 4: role of D2 dopamine receptor subtype during economic conflict

Experiment 5: role of D2 activation during modified economic conflict task

Discussion

Validation of EvP task

Effects of cocaine on EvP

Parsing the roles of dopamine receptor subtypes in EvP

D1-like receptor modulation

D2-like receptor modulation

Conclusions and implications

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation