On the effects of impulsivity and compulsivity on neural correlates of model-based performance

Dück, Kerstin; Wüllhorst, Raoul; Overmeyer, Rebecca; Endrass, Tanja

doi:10.1038/s41598-024-71692-w

On the effects of impulsivity and compulsivity on neural correlates of model-based performance

Article
Open access
Published: 10 September 2024

Volume 14, article number 21057, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

On the effects of impulsivity and compulsivity on neural correlates of model-based performance

Download PDF

Kerstin Dück¹,
Raoul Wüllhorst¹,
Rebecca Overmeyer¹ &
…
Tanja Endrass^1,2

298 Accesses
Explore all metrics

Abstract

Impaired goal-directed behavior is associated with a range of mental disorders, implicating underlying transdiagnostic factors. While compulsivity has been linked to reduced model-based (MB) control, impulsivity has rarely been studied in the context of reinforcement learning despite its links to reward processing and cognitive control. This study investigated the neural mechanisms underlying MB control and the influence of impulsivity and compulsivity, using EEG data from 238 individuals during a two-step decision making task. Single-trial analyses revealed a modulation of the feedback-related negativity (FRN), where amplitudes were higher after common transitions and positive reward prediction error (RPE), indicating a valence effect. Meanwhile, enhanced P3 amplitudes after rare transitions and both positive and negative RPE possibly reflect surprise. In a second step, we regressed the mean b values of the effect of RPE on the EEG signals onto self-reported impulsivity and compulsivity and behavioral MB control (w). The effect of RPE on FRN-related activity was mainly associated with higher w scores, linking the FRN to MB control. Crucially, the modulation of the P3 by RPE was negatively associated with compulsivity, pointing to a deficient mental model in highly compulsive individuals.

Expectation of reward differentially modulates executive inhibition

Article Open access 23 August 2019

Lateral prefrontal model-based signatures are reduced in healthy individuals with high trait impulsivity

Article Open access 13 October 2015

Does cognitive control ability mediate the relationship between reward-related mechanisms, impulsivity, and maladaptive outcomes in adolescence and young adulthood?

Article 22 May 2019

Introduction

Learning and decision-making are assumed to be influenced by two reinforcement learning mechanisms: model-free (MF) and model-based (MB) learning. MF learning relies on past action-reward experiences, leading to reward prediction errors (RPE) when expected outcomes do not occur. RPEs alter the value of a choice option and subsequently influence its selection probability^1,2,3,4. MF control facilitates performance with little cognitive effort, but adjustments are slow and it is linked to habitual and inflexible behavior⁵. In contrast, MB learning employs a mental model of a given task, mapping associations of actions and potential outcomes^1,2,3,4. Despite being more computationally demanding, MB learning facilitates goal-directed behavior in complex environments^6,7. Both systems operate in parallel, and the balance between them, computationally quantified by the weighting parameter w, varies with inter-individual differences and contextual factors^1,2,4.

Shifts towards MF learning can lead to suboptimal choices, and an imbalance between MF and MB control has been observed in various mental disorders, e.g. substance use disorder^8,9, schizophrenia¹⁰, gambling disorder¹¹, anorexia nervosa¹², or obsessive–compulsive disorder^13,14. This range of disorders, along with changes in MF and MB control in non-clinical samples¹⁵, suggests that dysfunction in goal-directed behavior may reflect common underlying mechanisms.

Impulsivity and compulsivity are strong candidates for transdiagnostic factors: Impulsivity, characterized by a tendency toward rapid, unplanned actions¹⁶, is associated with reward seeking and behaviors carrying the risk of harm, such as gambling or aggressive actions^17,18,19. Heightened impulsivity is also linked to SUD²⁰ and other disorders characterized by altered reinforcement learning^21,22,23. The associations of impulsivity with cognitive control^24,25,26 and decision making^27,28,29 have been studied extensively. Although research directly linking reinforcement learning and impulsivity is limited, existing studies suggest associated dysfunctions^30,31. Conversely, compulsivity, involving the tendency to repeat actions despite adverse consequences³², is linked to deficits in MB control in the general population³³ and various mental disorders within the compulsivity spectrum³⁴. Impulsivity and compulsivity have been found to overlap significantly in both neurocircuitry³⁵ as well as associations with psychopathology, particularly disorders implicated with altered MB control^36,37. Although these traits have been found to interact at both the psychophysiological³⁸ and symptom levels^39,40, prior studies have typically examined their effects on MB control independently, without considering their possible interplay.

To address this gap, research considering both dimensions within reinforcement learning paradigms is highly warranted to delineate their individual effects and possible interactions. In this study, we employed a sequential decision-making task (two-step task), which is a task designed to investigate MB learning. In its original version², participants are given the choice of two stimuli in a first stage, which then leads them to one of two second stages, each with another distinct stimulus pair to choose between. The transition between the first and second stage is probabilistic. Each choice is linked to one second stage with a high probability (common transition) and to the other second stage with a low probability (rare transition). In the second stage, participants choose between two stimuli resulting in monetary reward or loss, with the outcome changing over time (Fig. 1). MF and MB learners can be differentiated by the extent they consider the transition structure of the task in their first-stage decisions. Since MF learners are assumed to base the perceived value of actions on previous reward, choices in the first stage are likely to be repeated if participants were subsequently rewarded in the second stage, regardless of transition type. In contrast, MB control relies on action-outcome associations, where learners consider both the obtained reward and the transition type from the previous trials. For example, if a reward follows a rare transition, MB learners are more likely switch rather than stay, as they predict a higher likelihood of reward by choosing the alternative first-stage option.

We examined the neural mechanisms of reinforcement learning during the two-step task and their relation to impulsivity and compulsivity, using electroencephalography (EEG). The feedback-related negativity (FRN) occurs fronto-centrally 200–350 ms after feedback presentation and is associated with reward prediction errors^41,42,43, which MF control is based on. Given that impulsive individuals tend to prioritize immediate rewards⁴⁴ and show more MF behavior³⁰, a close relationship to the FRN appears likely. The centroparietal P3, a positive-going component starting around 300 ms after stimulus presentation, shows associations with the probability and relevance of stimuli for building and updating representations of the environment⁴⁵. In sequential decision making paradigms, the P3 should thus be linked to the representation of the task structure. P3 amplitude has been found to vary with transition structure^46,47 and RPE⁴⁷, both integral for a mental model of the task. Task representation deficits reflected in the EEG have been linked to reduced MB planning in individuals with higher compulsivity⁴⁸.

As feedback processing is central for both MF and MB control in reinforcement learning, altered RPE signaling is expected to underlie dysfunctions in decision making. We thus investigated how RPE signals (FRN, P3) in a two-step task were modulated by both impulsivity and compulsivity. Using single-trial analyses, we examined the coupling between RPE and brain activity focusing on the time-windows and locations associated with the FRN and P3. Considering the associations between RPE, FRN, and impulsivity, we hypothesized that the RPE modulation of the FRN would increase with higher impulsivity. Meanwhile, because compulsivity is considered to impede task representation, we expected a diminished effect of RPE on the P3 in individuals with high compulsivity. Lastly, we explored potential interaction effects of impulsivity and compulsivity.

Results

Behavior

We employed a logistic mixed-effect model to investigate whether participants’ first-stage choices showed characteristics of model-free and model-based behavior, and to assess the impact of impulsivity and compulsivity on these tendencies. Characteristics of the previous trial (reward and transition type) and participants’ BIS-11 and OCI-R scores were regressed onto stay probability. As shown in Table 1, participants showed significant effects for reward (β = 1.27, p = < 0.001) and reward*transition interaction (β = 2.01, p = < 0.001), indicating that both MF and MB learning determined behavior. Neither the BIS-11 nor the OCI-R showed significant effects and there was no effect of gender.

Table 1 Fixed effects of the generalized linear mixed-effects model for stay probability in stage 1.

Full size table

Further, we explored the association of RT differences (rare minus common) as a marker for MB learning with impulsivity and compulsivity. Regression analysis revealed neither main nor interaction effects (see supplementary table S2).

EEG data

First-level results: Task effects

The effects of transition type and RPE on second-stage EEG data were analyzed using single trial regression to obtain a regression weight time-course for all electrodes (Fig. 2). For transition type, we observed a significant negative effect starting around 180 ms after feedback onset, indicating more positive EEG signals for rare compared to common trials. Thus, the FRN at FCz was more negative for common trials (β_mean = − 0.62, p = < 0.001), while the P3 was more positive for rare trials (β_mean = − 1.25, p = < 0.001 at Cz). Single-trial regression additionally revealed a significant effect of RPE during the FRN time window (β _mean = 1.63, p = < 0.001 at FCz) characterized by lower (less negative) amplitudes for positive RPEs as compared to negative RPEs. We also observed a significant RPE effect for P3 (β _max = 0.55, p = < 0.001 at Cz), indicating that amplitudes were higher (more positive) for both positive and negative RPE. There also were significant transition*RPE interaction effects in the FRN (β_mean = 0.933, p = < 0.001 at FCz) and P3 (β_mean = 0.88 at 356 ms, p = < 0.001 at Cz) time windows (see supplementary figure S1). As depicted in Fig. 3, the RPE effects were stronger for common trials at FCz and Cz.

Second-level results: Impulsivity and compulsivity effects

We examined how self-reported impulsivity and compulsivity, as well as w, modulated the first-level effect of RPE on the EEG signal during the FRN and P3 in common and rare trials (Table 2). The RPE effect in the FRN window was significantly positively associated with w scores (common trials: β = 0.31, p = 0.023; rare trials: β = 0.33, p = 0.048), indicating larger RPE-related amplitude modulations in individuals with higher w scores (see Fig. 4). We also found a trend level effect of BIS-11 scores on FRN-related activity (β = 0.23, p = 0.073) and an interaction of BIS-11 and w scores (β = − 0.26, p = 0.029) after common transitions. Furthermore, the RPE effect on P3-related activity (see Fig. 5) on rare trials was negatively associated with OCI-R scores (β = − 0.45, p = 0.010), indicating reduced RPE-related amplitude modulations in individuals with higher compulsivity. Additionally, we observed a significant interaction of BIS-11 and w scores (β = 0.39, p = 0.015) for the RPE effect on P3-related activity. To follow up on the interactions of BIS-11 and w, we computed regressions of each predictor (BIS-11 or w) in median split (high vs. low) groups of the other (Fig. 6 and supplementary table S3). For the FRN, both BIS-11 and w showed positive associations with the RPE effect (β _BIS-11 = 0.40 and β _w = 0.49) when the other predictor was held low, which were dampened with high BIS-11 (β _w = 0.17) and high w (β _BIS-11 = − 0.03) scores. Regarding the effect of RPE on P3-related activity, we found a positive link with w in the high impulsivity group (β_w = 0.34), whereas the association in the low impulsivity group was reduced (β_w = − 0.09). BIS-11 showed a negative association in the low w group (β_BIS-11 = − 0.43) which was reversed with high w (β_BIS-11 = 0.22).

Table 2 Robust linear regression results for common and rare trials.

Full size table

When we compared regression coefficients against each other using z-tests, only the effects of w on the FRN in varying levels of BIS-11 differed significantly (see supplementary table S3).

Discussion

In this study, we investigated the association of EEG correlates of outcome processing during a two-step decision-making task with interindividual differences in impulsivity, compulsivity and MB control (w). Single-trial regression revealed that FRN-related activity exhibited higher amplitudes following common transitions and reduced amplitudes after positive RPE. In contrast, the P3 showed enhanced amplitudes after rare transitions and in response to both positive and negative RPE. The effect of RPE on the FRN was positively associated with w scores, while the effect of the RPE on P3 activity showed a negative correlation with OCI-R scores following rare transitions. Furthermore, both the FRN and P3 effects were related to the BIS-11*w interaction.

Enhanced FRN amplitudes following common transitions, compared to rare transitions, point to a relationship between the FRN and MB control: Based on a mental model of the environment, it would be favorable to discern between transition types and rely more heavily on common trials, as they give more information on accumulating reward and overall task output. In contrast, the P3 was enhanced on rare trials, which is in line with the general assumption that the P3 reflects stimulus salience and probability^49,50 as well as context updating⁵¹. Signed RPE further revealed distinct effects on the FRN and P3. Whereas the FRN was less pronounced with positive RPE, the P3 was larger for both positive and negative RPE. Larger FRN amplitudes for neutral and negative RPEs, compared to positive RPEs, suggest that this modulation is driven by positive prediction errors. This supports the view that the FRN reflects feedback valence^17,52, rather than RPE magnitude or „unsigned “ RPE, as some researchers have suggested^43,53,54. However, our findings also align with the concept of the reward positivity (RewP). The RewP is assumed to capture an amplitude modulation attributed to rewards instead of negative or neutral feedback^55,56. While it is easily visible as a difference wave, the RewP has been suggested to be masked by the N2 component^57,58, potentially creating the negative deflection observed following negative feedback and also apparent in our data, which is positively modulated only by reward feedback. In the following discussion, we will thus consider both the FRN and the RewP. Additionally, higher P3 amplitudes after positive and negative RPEs suggest a role of surprise, consistent with the P3’s association with mental task representation⁴⁸. Lastly, the transition*RPE interaction revealed stronger RPE effects following common transitions relative to rare transitions for all investigated ERPs. As described above, tracking RPE after common transitions is more informative for overall reward, suggesting MB control.

In second-level analyses, P3-related activity displayed a modulation by compulsivity: individuals with high OCI-R scores showed a reduced influence of RPE on the P3. Notably, this effect appeared only after rare transitions, pointing to a reduced use of RPEs from these trials to update choice values. As this process relies on integrating transition information and RPE and is based on a mental task representation, it may indicate impaired MB control. Supporting evidence for an association of high compulsivity with deficient mental models comes from several neuroimaging studies in compulsivity-related disorders⁵⁹: For example, smokers processed prediction errors from fictive outcomes, yet did not adapt their behavior accordingly⁶⁰, while alcohol-dependent individuals showed reduced activation related to the updating of different choice options⁶¹.

Our findings align with those of Seow et al.⁴⁸, who observed that P3, alpha power and RT after second-stage stimulus presentation were all sensitive to transition type, although only the latter two were associated with MB control and compulsivity. The authors propose that reduced orienting processes (reflected in RT and alpha power) indicate a deficient representation of transition structure, pointing to an impaired mental model. This relationship between behavioral and neural responses to transition type and mental task representation appears more visible in stimulus-locked data. Meanwhile, our data suggests neural deficits in the mental model after feedback. It is possible that compulsivity affects MB control at both levels, with stimulus- and feedback-locked analyses offering complementary insights into these deficits.

MB control was also relevant for FRN/RewP-related activity, as we found that higher w scores modulated the effect of RPE. This is consistent with the notion that w reflects a tendency toward MB control, because RPE is not merely based on feedback valence, but instead requires a mental task representation to form predictions. According to this mental model, tracking RPE in common trials is more crucial because it impacts the overall points obtained. Thus, the FRN/RewP appears as both a reflection of the integration of RPE into MB learning as well as reward processing related to MF learning. This overlap is consistent with neuroimaging finding, where MF and MB signals are not entirely distinct; e.g. with the ventral striatum, a region associated with reward processing¹⁷, has been shown to reflect both types of control².

BIS-11 scores showed a trend-level effect on RPE modulation, possibly because impulsivity is associated with reward processing^17,18,63 and might influence RPE to some extent, it primarily impacts MF rather than MB learning, thus limiting its effect on RPE modulation. However, the regression effects of BIS-11 and w were qualified by an interaction on common trials, which was followed up by regression analyses of each predictor in median split (high vs. low) groups of BIS-11 or w scores, respectively. Results showed that the effect of w differed significantly between levels of BIS-11, with a positive relationship between w and the FRN/RewP only when impulsivity was low. Impulsivity has been associated with reduced goal-directed control^64,65, whereas high w, as an indicator of model-basedness, is related to reduced MF learning. The interaction effect suggests that impulsivity is more linked to MF control and that the FRN/RewP reflects not only MF, but also MB control, consistent with findings that the RewP signals the integration of complex action sequences⁶⁶. Post-hoc analyses also revealed changes in the effects of BIS-11 on the FRN/RewP and both predictors on the P3 that further depict the BIS-11*w interaction. Although differences in regression scores from median split groups did not reach significance, the interaction effects point to an interplay between impulsivity and markers of goal-directed control, requiring further research to disentangle their specific effects.

Behavioral analyses focused on participants’ tendency to repeat a first-stage decision. The linear mixed models revealed effects of the previous trial’s outcome and transition (reward and reward*transition interaction), pointing to the well-established mixture of MF and MB control^46,48,67. Yet, unlike some previous studies, we did not observe interaction effects with BIS-11 and OCI-R scores. For example, Deserno et al.³⁰, reported the reward effects on stay probabilities to vary with impulsivity, but their study compared extreme groups of impulsivity within a smaller, less homogeneous sample, which may have increased effects. Raio et al.³¹ found impulsivity scores to affect choice behavior only in the second stage of a two-step task, where participants can be influenced by previous reward, but MF and MB control are no longer differentiated. Thus, these results on decision-making deficits may not apply to our analyses. Seow et al.⁴⁸ found that individuals with higher compulsivity scores had a reduced prolongation of reaction times after rare vs. common transitions , which was interpreted as impaired mental task representation. We could not replicate this result in our study. Possibly, effects of impulsivity and compulsivity on outcome processing on the neural level do not translate directly into participants’ first-stage choice behavior. This might be due to compensatory mechanisms, e.g. high cognitive effort. Outside of a controlled laboratory setting, where various internal and external factors challenge resources, individuals might find it more difficult to maintain MB control.

One limitation of our study is the potential impact of gender. Literature indicates that impulsivity scores vary by gender⁶⁸ and there is evidence that gender moderates the association of trait impulsivity and risk behavior^69,70. In reinforcement learning, gender differences have been found behaviorally⁷¹ and in the relationship between stress and cortico-striatal brain function⁷², suggesting possible confounding effects. However, since our study did not find a gender effect at the behavioral level, we did not include gender in our EEG analyses to maintain models parsimonious. Future studies are warranted to address this gap in reinforcement learning studies.

In conclusion, we found feedback processing in the EEG to be modulated by interindividual differences. As expected, RPE had less influence on the P3 in highly compulsive individuals, which we interpreted as a deficient mental model. However, further research is needed on the nature of the mental model deficits, i.e. if the model itself is not built or not applied properly. Although impulsivity is often connected to poor decision-making outside of the laboratory, the association in our task was only weak. More research to bridge this gap to “real-life” deficits is thus highly warranted. Still, based on our large community sample, our results may help explain impaired decision-making as seen in reduced MB control in a specific task as well as the everyday behavior that characterizes compulsivity in both clinical and healthy populations.

Methods

Participants

Two-hundred fifty-three participants from the general Dresden area participated in the study. Inclusion criteria were age 18–45 years, native German speakers, and normal or corrected-to-normal vision. Participants were excluded if they reported a history of neurological disorder or head trauma; lifetime diagnosis of bipolar disorder, borderline personality disorder, psychotic episodes, or severe alcohol use disorder; acute eating disorder or severe episode of major depression; psychotropic medication within the last three months; lifetime use of illicit substances more than twice a year or lifetime use of cannabis more than twice a month. We excluded participants from analysis due to poor task compliance (N = 11), technical EEG recording errors (N = 3), and retrospective identification of exclusion criteria (cannabis use, N = 1). Thus, the final sample consisted of 238 participants (50% female; age M = 25.04, SD = 4.84; education level: 95% high school or higher).

The project has been approved by the ethics committee at the University Hospital Carl Gustav Carus, at TU Dresden (EK 372092017) and was conducted in accordance with the ethical guidelines of the Declaration of Helsinki. All participants gave informed consent and received financial compensation (80–100 €) or course credit for participation. The study is part of a larger research project which assessed different cognitive control functions in relation to impulsivity and compulsivity (https://osf.io/ywnze/).

Procedure and measures

The two-step task was completed as part of an EEG session in the lab. Additionally, participants completed other EEG tasks as well as a neuropsychological test battery at a first lab appointment, and ecological momentary assessment, which will not be reported here.

Two-step task

Participants performed a modified two-step task^2,7, consisting of two subsequent decision making stages. First, they were asked to choose one of two first-stage stimuli (cartoon drawings of spaceships), which would then lead them to one of two possible second stages (planets). Each first-stage stimulus was associated with a second stage with a transition probability of 80% (common transition) or 20% (rare transition), respectively (see Fig. 1). In the second stage, participants were presented with a set of two stimuli (aliens) specific to the particular stage, and again instructed to choose one, resulting in a number of points added to or subtracted from their total count. The reward probability at stage two followed a random walk with reflective bounds at + 5 and − 4 points. Participants were told that choosing one alien meant asking it to dig for space treasure. The aliens then presented them with up to 5 pieces of space treasure (adding points) or up to 4 pieces of “anti-matter” (subtracting points), or nothing (no points). The goal was to collect as much treasure, i.e. points, as possible, which would be transformed into a bonus of up to 5 € at the end of the task.

The task was composed of 500 trials split into four blocks. First and second-stage stimuli remained on screen until the response (left or right index fingers; max. response window: 2000 ms), the selected option was then marked for 500–800 ms. After the second-stage response, the outcome was displayed for 1000 ms as the respective number of icons for treasure or anti-matter, together with a bar indicating the subject’s total point count. A black screen was shown for 300–800 ms between trials.

Participants were instructed that each spaceship had a fixed preference for one planet and that the outcome of each alien would change over time. They familiarized themselves with the transition and reward structure of the task before completing a block of 25 of practice trials.

Personality scales

Impulsivity

We used a German translation of the 11th version of the Barratt Impulsiveness Scale (BIS-11)⁷³. In 30 self-report items, the questionnaire tests for attentional, motor, and non-planning impulsiveness and yields a sum score with good internal consistency (α = 0.83)⁷⁴.

Compulsivity

Compulsivity was operationalized using the Obsessive–Compulsive Inventory-Revised (OCI-R)^75,76. The self-report questionnaire measures obsessive–compulsive symptom severity (washing, checking, doubting, ordering, obsessing [i.e., having obsessional thoughts], hoarding, and mental neutralizing). We again used the sum score, which has shown good internal consistency⁷⁵. Both BIS-11 and OCI-R scores were z-standardized for further analyses.

Data acquisition and analysis

Data was analyzed with MATLAB R2021a⁷⁷ and the EEGlab toolbox (version 14.1.2b)⁷⁸ using the high performance computing system (HPC) at the TU Dresden. Further regression analyses were performed with R⁷⁹.

Behavioral data and computational modelling

First-stage choice data

Participants were excluded from all analyses if they showed random choice behavior (N = 11, see sample description), i.e., if the probability to repeat their last first-stage choice (stay probability) was not positively associated with last trial’s reward or reward*transition interaction in a logistic regression model.

Behavioral data was further analyzed using the R package lme4⁸⁰. We tested to what extent the tendency of each participant to repeat their last first-stage choice (stay: 1 | switch: 0) was explained by the transition type (common: 1 | rare: − 1) and reward (yes: 1 | no: − 1) of the previous trial via a logistic mixed-effect model. MF learning, being solely reward-driven, is signified by the tendency to repeat choices after a win and to switch after a loss. In contrast, MB behavior takes the transition structure into account, signified by the tendency to repeat the response after a win if the transition was common, but to switch after a win if the transition was rare. Thus, individual β weights for reward provided an estimate for MF behavior, whereas the interaction between reward and transition type indicated model-based behavior.

It was of particular interest how these terms were associated with BIS-11 and OCI-R scores, to examine how impulsivity and compulsivity affected behavior. Individual intercepts and regression weights for transition, reward, and the transition*reward interaction were included as random effects to allow for variance across participants. In R syntax, the model was: stay ~ transition * reward * BIS-11 * OCI-R + (transition * reward | subject).

Based on findings from Seow et al.⁴⁸, we also explored the reaction time (RT) difference for common and rare trials. Surprise after rare transitions, causing an orienting process indicated in longer RT, is based on a mental representation of the transition structure. It can thus be seen as a marker for MB learning. The relationship between RT difference (median RT in rare—median RT in common trials), impulsivity and compulsivity was examined in robust regression (RT_delta ~ BIS-11*OCI-R).

Computational modelling

Following Kool et al.⁷, the task involved two stages with three possible states s (stage 1: s_A; stage 2: s_B or s_C), and two possible actions a (a_A and a_B). All models learn to maximize the value Q (s, a). At a given trial t, states are denoted as s_1,t (always s_A) and s_2,t (s_B or s_C), actions as a_1,t and a_2,t, and rewards as r_1,t (always equal to zero) and r_2,t.

Model-free. MF agents solve the task according to the SARSA(λ) temporal difference learning algorithm⁸¹, such that at each stage i and trial t

$${Q}_{MF}\left(s,a\right)= {Q}_{MF}\left(s,a\right)+ \alpha {\delta }_{i,t}{e}_{i,t}\left(s,a\right).$$

Here, α denotes the free learning rate parameter (indicating how fast values are updated), δ_i,t denotes the RPE, and e_i,t(s, a) denotes the free eligibility trace parameter.

As r_1,t is always equal to zero, the first-stage RPE depends on the second stage action:

$${\delta }_{1,t}= {Q}_{MF}\left({s}_{2,t},{a}_{2,t}\right)- {Q}_{MF}({s}_{1,t},{a}_{1,t})$$

The second-stage RPE depends on r_2,t:

$${\delta }_{2,t}= {r}_{2,t}-{Q}_{MF}\left({s}_{2,t},{a}_{2,t}\right).$$

The eligibility trace equals 0 at the beginning of each trial and is updated before the Q value according to

$${e}_{i,t}\left({s}_{i,t},{a}_{i,t}\right)={e}_{i-1,t}\left({s}_{i,t},{a}_{i,t}\right)+1.$$

First- and second-stage value updates occurred at the second stage. Here, prediction errors of first-stage values were weighted by the eligibility trace decay (also referred to as λ, which, if equal to zero, indicates that only values of the current stage receive an update).

Model-based. MB agents extend the model-free algorithm at the first stage by taking into account the transition structure P linking the first and second stages:

$${Q}_{MB}\left({s}_{A},{a}_{j}\right)=P\left({s}_{B} | {s}_{A},{a}_{j}\right)\underset{a\in \left\{{a}_{A},{a}_{B}\right\}}{\text{max}}{Q}_{MF}\left({s}_{B},a\right)+P\left({s}_{C} | {s}_{A},{a}_{j}\right)\underset{a\in \left\{{a}_{A},{a}_{B}\right\}}{\text{max}}{Q}_{MF}\left({s}_{C},a\right).$$

At the second stage, model-free and model-based agents perform equivalent updates, such that Q_MF = Q_MB.

Hybrid. Hybrid agents arbitrate between the Q values according to a weighting parameter w:

$${Q}_{net}\left({s}_{A},{a}_{j}\right)=w{Q}_{MB}\left({s}_{A},{a}_{j}\right)+\left(1-w\right){Q}_{MF}\left({s}_{A},{a}_{j}\right).$$

Decision rule. Finally, Q values were subjected to a softmax function to determine choice probabilities:

$$P\left( {a_{i,t} = a|s_{i,t} } \right) = \frac{{e^{{\beta \left( {Q_{net} \left( {s_{i,t} ,a} \right) + \pi *rep\left( a \right) + \rho *resp\left( a \right)} \right)}} }}{{\sum {e^{{\beta \left( {Q_{net} \left( {s_{i,t} ,a^{\prime}} \right) + \pi *rep\left( {a^{\prime}} \right) + \rho *resp\left( {a^{\prime}} \right)} \right)}} } }}.$$

Here, β indicates the stochasticity of behavior, π a choice stickiness pareter (multiplied by rep(a) = 1 if first-stage action a was chosen on the current as well as the previous trial, otherwise zero), and ρ a response stickiness parameter (multiplied by resp(a) = 1 if the first-stage action a involved the same response key on the current as well as the previous trial, otherwise zero). We compared MF, MB and hybrid models excluding π and ρ (pure model), including π (+ choice stickiness model), and including π and ρ (+ choice + response stickiness model; see Supplement for more detail on parameter estimation and model fitting). The hybrid model including choice stickiness proved to have the most parsimonious fit, indicated by the Bayesian Information Criterion, and was used to retrieve individual behavioral task parameters, e.g., the weighting parameter w.

EEG recording and data reduction

EEG was recorded with Ag/AgCl electrodes from 61 sites of an equidistant electrode montage (Easycap GmbH, Breitbrunn, Germany) as well as from three external positions: approximately 2 cm below each eye to record eye movements and at the lower back to record the electrocardiogram. The EEG was amplified with two 32-channel BrainAmp amplifiers (Brain Products GmbH, Munich, Germany), recorded at a sampling rate of 500 Hz, and referenced to an electrode next to FCz. Offline, continuous data was filtered (0.1–30 Hz). After submitting data to an adaptive mixture independent component analysis (AMICA), we employed visual inspection together with the ICLabel toolbox⁸² to remove components containing eye-movement and cardioballistic artifacts. EEG data was then re-referenced to an average reference. Epochs of − 200 to 1000 ms around second-stage outcome presentation were subjected to adaptive artifact rejection, removing epochs containing deviations > 4 SDs of the mean probability distribution with the constraint to remove at least one and a maximum of 5% of trials or to otherwise adapt the SD threshold in steps of 0.1⁸³. Epochs underwent a baseline correction (− 200 to 0 ms prior to outcome presentation) and trials including preliminary responses (reaction time < 100 ms) in either stage one or two were removed.

EEG analyses

First-level analyses

Electrophysiological data was subjected to single-trial analyses to quantify the relationship between EEG activity and trial-wise characteristics within our computational model. Second-stage feedback-locked data were used to investigate transition type and RPE. RPE was signed, i.e. could be positive or negative, combining valence and magnitude. We regressed EEG activity at each electrode and time point on trial type (common or rare) and RPE, using robust regression (EEG ~ Transition + RPE + Transition*RPE). The resulting temporo-spatial maps of b values per subject were then averaged over subjects to investigate whether transition type and RPE significantly accounted for variance in EEG activity. As we specifically assumed the FRN and P3 to reflect transition and RPE effects, analyses focused on the electrodes and time-windows corresponding to these event-related potentials (ERP). Search locations and intervals for each component were based on visual inspection of the raw EEG (grand averaged over all subjects). The FRN was thus determined as the negative EEG peak at FCz in a window of 250–350 ms after stimulus onset (peak latency: 294 ms) and the P3 as the positive peak between 330–430 ms at Cz (peak latency: 370 ms). B values were subjected to two-tailed one-sample t-tests against zero, employing false discovery rate⁸⁴ (FDR) to correct for multiple comparisons.

Second-level analyses

To examine how first-level RPE signals varied as a function of impulsivity and compulsivity as well as the weighting of MF and MB control (w), we examined their effect on first-level b values for the RPE effect. Since our first-level analyses revealed significant interaction effects of RPE and transition type at FCz, and Cz, suggesting that RPE is processed differently in common and rare trials, we conducted our further analyses separately within each transition type. In order to isolate the influence of impulsivity, compulsivity and w on distinct, RPE-related processes, analyses focused on the ERPs of interest (see above). We computed the mean first-level effect of RPE per subject by averaging the RPE b values in windows of − 25 ms and + 25 ms around the FRN- and P3-related peaks at FCz and Cz, respectively. These average RPE effects then served as dependent variables in robust linear regression models with z-scored impulsivity, compulsivity and w (weighting parameter) as well as their interactions as simultaneous predictors (mean b values ~ BIS-11 + OCI-R + w + BIS-11*OCI-R + BIS-11*w + OCI-R*w + BIS-11*OCI-R*w). Significant BIS-11*w interactions were followed up by regression analyses within high and low impulsivity and w score groups (based on median split). Pairs of regression coefficients were then compared in z-tests. Focusing on single values for the effect of RPE on the FRN and P3 allowed us to explore different regressors and their interactions without losing power due to correcting for multiple tests.

Data availability

Data and analysis routines are accessible under https://osf.io/vytdr/.

References

Gläscher, J., Daw, N., Dayan, P. & O’Doherty, J. P. States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron 66, 585–595 (2010).
Article PubMed PubMed Central Google Scholar
Daw, N. D., Gershman, S. J., Seymour, B., Dayan, P. & Dolan, R. J. Model-based influences on humans’ choices and striatal prediction errors. Neuron 69, 1204–1215 (2011).
Article PubMed PubMed Central CAS Google Scholar
Daw, N. D., Niv, Y. & Dayan, P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat. Neurosci. 8, 1704–1711 (2005).
Article PubMed CAS Google Scholar
Dayan, P. & Niv, Y. Reinforcement learning: The good, the bad and the ugly. Curr. Opin. Neurobiol. 18, 185–196 (2008).
Article PubMed CAS Google Scholar
Gillan, C. M., Otto, A. R., Phelps, E. A. & Daw, N. D. Model-based learning protects against forming habits. Cognit. Affect. Behav. Neurosci. 15, 523–536 (2015).
Article Google Scholar
Doll, B. B., Simon, D. A. & Daw, N. D. The ubiquity of model-based reinforcement learning. Curr. Opin. Neurobiol. 22, 1075–1081 (2012).
Article PubMed PubMed Central CAS Google Scholar
Kool, W., Cushman, F. A. & Gershman, S. J. When does model-based control pay off?. Plos Comput. Biol. 12, e1005090 (2016).
Article ADS PubMed PubMed Central Google Scholar
Sebold, M. et al. Model-based and model-free decisions in alcohol dependence. Neuropsychobiology 70, 122–131 (2014).
Article PubMed CAS Google Scholar
Doñamayor, N. et al. Goal-directed and habitual control in human substance use: State of the art and future directions. Neuropsychobiology 81, 403–417 (2022).
Article PubMed Google Scholar
Culbreth, A. J., Westbrook, A., Daw, N. D., Botvinick, M. & Barch, D. M. Reduced model-based decision-making in Schizophrenia. J. Abnorm. Psychol. 125, 777–787 (2016).
Article PubMed PubMed Central Google Scholar
Wyckmans, F. et al. The modulation of acute stress on model-free and model-based reinforcement learning in gambling disorder. J. Behav. Addict. 11, 831–844 (2022).
Article PubMed PubMed Central Google Scholar
Daw, N. Model-based and model-free learning in anorexia nervosa and other disorders. Biol. Psychiatry 87, S20 (2020).
Article Google Scholar
Ruan, Z. et al. Impairment of arbitration between model-based and model-free reinforcement learning in obsessive–compulsive disorder. Front. Psychiatry 14, 1162800 (2023).
Article PubMed PubMed Central Google Scholar
Voon, V. et al. Motivation and value influences in the relative balance of goal-directed and habitual behaviours in obsessive-compulsive disorder. Transl. Psychiatry 5, e670–e670 (2015).
Article PubMed PubMed Central CAS Google Scholar
Gillan, C. M. et al. Comparison of the association between goal-directed planning and self-reported compulsivity vs obsessive-compulsive disorder diagnosis. JAMA Psychiat. 77, 77–85 (2020).
Article Google Scholar
Moeller, F. G., Barratt, E. S., Dougherty, D. M., Schmitz, J. M. & Swann, A. C. Psychiatric aspects of impulsivity. Am. J. Psychiat. 158, 1783–1793 (2001).
Article PubMed CAS Google Scholar
Pagnoni, G., Zink, C. F., Montague, P. R. & Berns, G. S. Activity in human ventral striatum locked to errors of reward prediction. Psychol. Rev. 5, 97–98 (2002).
CAS Google Scholar
Heyes, S. B. et al. Impulsivity and rapid decision-making for reward. Front. Psychol. 3, 153 (2012).
Google Scholar
Sharma, L., Markon, K. E. & Clark, L. A. Toward a theory of distinct types of “impulsive” behaviors: A meta-analysis of self-report and behavioral measures. Psychol. Bull. 140, 374–408 (2014).
Article PubMed Google Scholar
Lee, R. S. C., Hoppenbrouwers, S. & Franken, I. A systematic meta-review of impulsivity and compulsivity in addictive behaviors. Neuropsychol. Rev. 29, 14–26 (2019).
Article PubMed Google Scholar
Hoptman, M. J. Impulsivity and aggression in schizophrenia: a neural circuitry perspective with implications for treatment. CNS Spectr 20, 280–286 (2015).
Article PubMed PubMed Central Google Scholar
Prochazkova, L. et al. Unpacking the role of self-reported compulsivity and impulsivity in obsessive-compulsive disorder. CNS Spectrums 23, 51–58 (2018).
Article PubMed Google Scholar
Ioannidis, K., Hook, R., Wickham, K., Grant, J. E. & Chamberlain, S. R. Impulsivity in gambling disorder and problem gambling: A meta-analysis. Neuropsychopharmacology 44, 1354–1361 (2019).
Article PubMed PubMed Central Google Scholar
Dalley, J. W., Everitt, B. J. & Robbins, T. W. Impulsivity, compulsivity, and top-down cognitive control. Neuron 69, 680–694 (2011).
Article PubMed CAS Google Scholar
Dalley, J. W. & Robbins, T. W. Fractionating impulsivity: Neuropsychiatric implications. Nat. Rev. Neurosci. 18, 158–171 (2017).
Article PubMed CAS Google Scholar
Overmeyer, R. & Endrass, T. Disentangling associations between impulsivity, compulsivity and performance monitoring. ESS Open Archive https://doi.org/10.22541/au.168780067.77323886/v1 (2023).
Article Google Scholar
Kim, S. & Lee, D. Prefrontal cortex and impulsive decision making. Biol. Psychiatry 69, 1140–1146 (2011).
Article PubMed Google Scholar
Wise, R. J., Phung, A. L., Labuschagne, I. & Stout, J. C. Differential effects of social stress on laboratory-based decision-making are related to both impulsive personality traits and gender. Cogn. Emot. 29, 1475–1485 (2015).
Article PubMed Google Scholar
Petzold, J. et al. Baseline impulsivity may moderate L-DOPA effects on value-based decision-making. Sci. Rep. 9, 5652 (2019).
Article ADS PubMed PubMed Central Google Scholar
Deserno, L. et al. Lateral prefrontal model-based signatures are reduced in healthy individuals with high trait impulsivity. Transl. Psychiatry 5, e659–e659 (2015).
Article PubMed PubMed Central CAS Google Scholar
Raio, C. M., Konova, A. B. & Otto, A. R. Trait impulsivity and acute stress interact to influence choice and decision speed during multi-stage decision-making. Sci. Rep.-UK 10, 7754 (2020).
Article ADS CAS Google Scholar
Luigjes, J. et al. Defining compulsive behavior. Neuropsychol. Rev. 29, 4–13 (2019).
Article PubMed PubMed Central Google Scholar
Gillan, C. M., Kosinski, M., Whelan, R., Phelps, E. A. & Daw, N. D. Characterizing a psychiatric symptom dimension related to deficits in goal-directed control. Elife 5, e11305 (2016).
Article PubMed PubMed Central Google Scholar
Voon, V., Reiter, A., Sebold, M. & Groman, S. Model-based control in dimensional psychiatry. Biol. Psychiatry 82, 391–400 (2017).
Article PubMed Google Scholar
Fineberg, N. A. et al. New developments in human neurocognition: Clinical, genetic, and brain imaging correlates of impulsivity and compulsivity. CNS Spectrums 19, 69–89 (2014).
Article PubMed PubMed Central Google Scholar
Chamberlain, S. R. et al. Fractionation of impulsive and compulsive trans-diagnostic phenotypes and their longitudinal associations. Aust. N. Z. J. Psychiatry 53, 896–907 (2019).
Article PubMed PubMed Central Google Scholar
Robbins, T. W., Gillan, C. M., Smith, D. G., de Wit, S. & Ersche, K. D. Neurocognitive endophenotypes of impulsivity and compulsivity: Towards dimensional psychiatry. Trends Cogn. Sci. 16, 81–91 (2012).
Article PubMed Google Scholar
Overmeyer, R., & Endrass, T. Disentangling associations between impulsivity, compulsivity, and performance monitoring. Psychophysiology e14539 (2024).
Liu, C. et al. Distress-driven impulsivity interacts with trait compulsivity in association with problematic drinking: A two-sample study. Front. Psychiatry 13, 938275 (2022).
Article PubMed PubMed Central Google Scholar
Albertella, L. et al. The influence of trait compulsivity and impulsivity on addictive and compulsive behaviors during COVID-19. Front. Psychiatry 12, 634583 (2021).
Article PubMed PubMed Central Google Scholar
Miltner, W. H. R., Braun, C. H. & Coles, M. G. H. Event-related brain potentials following incorrect feedback in a time-estimation task: Evidence for a “generic” neural system for error detection. J. Cognit. Neurosci. 9, 788–798 (1997).
Article CAS Google Scholar
Walsh, M. M. & Anderson, J. R. Learning from experience: Event-related potential correlates of reward processing, neural adaptation, and behavioral choice. Neurosci. Biobehav. Rev. 36, 1870–1884 (2012).
Article PubMed PubMed Central Google Scholar
Sambrook, T. D. & Goslin, J. A neural reward prediction error revealed by a meta-analysis of ERPs using great grand averages. Psychol. Bull. 141, 213–235 (2015).
Article PubMed Google Scholar
Otto, A. R., Markman, A. B. & Love, B. C. Taking more. Now. Soc. Psychol. Pers. Sci. 3, 131–138 (2012).
Article Google Scholar
Donchin, E. & Coles, M. G. H. Is the P300 component a manifestation of context updating?. Behav. Brain Sci. 11, 357–374 (1988).
Article Google Scholar
Eppinger, B., Walter, M. & Li, S.-C. Electrophysiological correlates reflect the integration of model-based and model-free decision information. Cognit. Affect. Behav. Neurosci. 17, 406–421 (2017).
Article Google Scholar
Wurm, F., Ernst, B. & Steinhauser, M. The influence of internal models on feedback-related brain activity. Cognit. Affect. Behav. Neurosci. 20, 1070–1089 (2020).
Article Google Scholar
Seow, T. X. F. et al. Model-based planning deficits in compulsivity are linked to faulty neural representations of task structure. J. Neurosci. 41, 6539–6550 (2021).
Article PubMed PubMed Central CAS Google Scholar
Polich, J. Task difficulty, probability, and inter-stimulus interval as determinants of P300 from auditory stimuli. Electroencephalogr. Clin. Neurophysiol. Evoked Potentials Sect. 68, 311–320 (1987).
Article CAS Google Scholar
Martens, S., Elmallah, K., London, R. & Johnson, A. Cuing and stimulus probability effects on the P3 and the AB. Acta Psychol. 123, 204–218 (2006).
Article Google Scholar
Polich, J. Updating P300: An integrative theory of P3a and P3b. Clin. Neurophysiol. 118, 2128–2148 (2007).
Article PubMed PubMed Central Google Scholar
von Borries, A. K. L., Verkes, R. J., Bulten, B. H., Cools, R. & de Bruijn, E. R. A. Feedback-related negativity codes outcome valence, but not outcome expectancy, during reversal learning. Cogn. Affect. Behav. Neurosci. 13, 737–746 (2013).
Article Google Scholar
Rawls, E. et al. Feedback-related negativity and frontal midline theta reflect dissociable processing of reinforcement. Front. Hum. Neurosci. 13, 452 (2020).
Article PubMed PubMed Central Google Scholar
Gu, R. et al. Valence and magnitude ambiguity in feedback processing. Brain Behav 7, e00672 (2017).
Article PubMed PubMed Central Google Scholar
Holroyd, C. B., Pakzad-Vaezi, K. L. & Krigolson, O. E. The feedback correct-related positivity: Sensitivity of the event-related brain potential to unexpected positive feedback. Psychophysiology 45, 688–697 (2008).
Article PubMed Google Scholar
Proudfit, G. H. The reward positivity: From basic research on reward to a biomarker for depression. Psychophysiology 52, 449–459 (2015).
Article PubMed Google Scholar
Baker, T. E. & Holroyd, C. B. Which way do I go? Neural activation in response to feedback and spatial processing in a virtual T-maze. Cereb. Cortex 19, 1708–1722 (2009).
Article PubMed Google Scholar
Holroyd C. A note on the oddball N200 and the feedback ERN. In: Errors, conflicts, and the brain: Current opinions on performance monitoring. Max Planck Institute, 2004, pp 211–218.
Voon, V. et al. Disorders of compulsivity: A common bias towards learning habits. Mol. Psychiatr. 20, 345–352 (2015).
Article CAS Google Scholar
Chiu, P. H., Lohrenz, T. M. & Montague, P. R. Smokers’ brains compute, but ignore, a fictive error signal in a sequential investment task. Nat. Neurosci. 11, 514–520 (2008).
Article PubMed CAS Google Scholar
Reiter, A. M. F. et al. Behavioral and neural signatures of reduced updating of alternative options in alcohol-dependent patients during flexible decision-making. J Neurosci 36, 10935–10948 (2016).
Article PubMed PubMed Central CAS Google Scholar
Polich, J. & Criado, J. R. Neuropsychology and neuropharmacology of P3a and P3b. Int. J. Psychophysiol. 60, 172–185 (2006).
Article PubMed Google Scholar
Jimura, K., Chushak, M. S. & Braver, T. S. Impulsivity and self-control during intertemporal decision making linked to the neural dynamics of reward value representation. J. Neurosci. 33, 344–357 (2013).
Article PubMed PubMed Central CAS Google Scholar
Hogarth, L., Chase, H. W. & Baess, K. Impaired goal-directed behavioural control in human impulsivity. Q. J. Exp. Psychol. 65, 305–316 (2012).
Article Google Scholar
Cools, R. et al. Tryptophan depletion disrupts the motivational guidance of goal-directed behavior as a function of trait impulsivity. Neuropsychopharmacology 30, 1362–1373 (2005).
Article PubMed CAS Google Scholar
Rommerskirchen, L., Lange, L. & Osinsky, R. The reward positivity reflects the integrated value of temporally threefold-layered decision outcomes. Psychophysiology 58, e13789 (2021).
Article PubMed Google Scholar
Ruel, A., Bolenz, F., Li, S.-C., Fischer, A. & Eppinger, B. Neural evidence for age-related deficits in the representation of state spaces. Cereb. Cortex https://doi.org/10.1093/cercor/bhac171 (2022).
Article Google Scholar
Cyders, M. A. Impulsivity and the sexes. Assessment 20, 86–97 (2013).
Article PubMed Google Scholar
Stoltenberg, S. F., Batien, B. D. & Birgenheir, D. G. Does gender moderate associations among impulsivity and health-risk behaviors?. Addict. Behav. 33, 252–265 (2008).
Article PubMed Google Scholar
Dir, A. L., Coskunpinar, A. & Cyders, M. A. A meta-analytic review of the relationship between adolescent risky sexual behavior and impulsivity across gender, age, and race. Clin. Psychol. Rev. 34, 551–562 (2014).
Article PubMed Google Scholar
Evans, K. L. & Hampson, E. Sex-dependent effects on tasks assessing reinforcement learning and interference inhibition. Front. Psychol. 6, 1044 (2015).
Article PubMed PubMed Central Google Scholar
Zühlsdorff, K. et al. Sex-dependent effects of early life stress on reinforcement learning and limbic cortico-striatal functional connectivity. Neurobiol. Stress 22, 100507 (2023).
Article PubMed Google Scholar
Patton, J. H., Stanford, M. S. & Barratt, E. S. Factor structure of the barratt impulsiveness scale. J. Clin. Psychol. 51, 768–774 (1995).
Article PubMed CAS Google Scholar
Stanford, M. S. et al. Fifty years of the Barratt Impulsiveness Scale: An update and review. Pers. Indiv. Differ. 47, 385–395 (2009).
Article Google Scholar
Gönner, S., Leonhart, R. & Ecker, W. Das Zwangsinventar OCI-R—die deutsche Version des Obsessive-Compulsive Inventory-Revised—Ein kurzes Selbstbeurteilungsinstrument zur mehrdimensionalen Messung von Zwangssymptomen [The German version of the obsessive-compulsive inventory-revised: A brief self-report measure for the multidimensional assessment of obsessive-compulsive symptoms]. Psychotherapie Psychosomatik Medizinische Psychologie 57, 395–404 (2007).
Article PubMed Google Scholar
Foa, E. B. et al. The obsessive-compulsive inventory: Development and validation of a short version. Psychol. Assess. 14, 485–496 (2002).
Article PubMed Google Scholar
Inc TM. MATLAB. Natick, Massachusetts, 2021.
Delorme, A. & Makeig, S. EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134, 9–21 (2004).
Article PubMed Google Scholar
Team RC. R: A language and environment for statistical computing. R Foundation for Statistical Computing: Vienna, Austria, 2022. https://www.R-project.org/.
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015).
Article Google Scholar
Rummery, G. & Niranjan, M. On-line Q-learning using connectionist systems (Cambridge University, 1994).
Google Scholar
Pion-Tonachini, L., Kreutz-Delgado, K. & Makeig, S. ICLabel: An automated electroencephalographic independent component classifier, dataset, and website. Neuroimage 198, 181–197 (2019).
Article PubMed Google Scholar
Fischer, A. G. & Ullsperger, M. Real and fictive outcomes are processed differently but converge on a common adaptive mechanism. Neuron 79, 1243–1255 (2013).
Article PubMed CAS Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Methodol. 57, 289–300 (1995).
Article MathSciNet Google Scholar

Download references

Acknowledgements

This study was supported by the collaborative research center (CRC 940, C6). Funding organizations had no role in the preparation of the manuscript or the decision for publication. Computing time for data analysis was provided through the Center for Information Services and the High Performance Computing System at TU Dresden. We thank Julia Berghäuser for her contribution in data collection and preliminary data analysis. We further thank our student assistants for their support in data collection.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Faculty of Psychology, Chair for Addicition Research, Technische Universität Dresden, 01062, Dresden, Germany
Kerstin Dück, Raoul Wüllhorst, Rebecca Overmeyer & Tanja Endrass
Neuroimaging Center, Technische Universität Dresden, 01062, Dresden, Germany
Tanja Endrass

Authors

Kerstin Dück
View author publications
You can also search for this author in PubMed Google Scholar
Raoul Wüllhorst
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca Overmeyer
View author publications
You can also search for this author in PubMed Google Scholar
Tanja Endrass
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.D. performed data analysis and wrote the manuscript text. R.W. supervised data analysis and wrote the manuscript parts on computational modelling procedure. R.O. conducted data collection. T.E. conceived and supervised the project. All authors contributed to interpretation and reviewed the manuscript.

Corresponding author

Correspondence to Kerstin Dück.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dück, K., Wüllhorst, R., Overmeyer, R. et al. On the effects of impulsivity and compulsivity on neural correlates of model-based performance. Sci Rep 14, 21057 (2024). https://doi.org/10.1038/s41598-024-71692-w

Download citation

Received: 23 May 2024
Accepted: 30 August 2024
Published: 10 September 2024
DOI: https://doi.org/10.1038/s41598-024-71692-w
Springer Nature Limited

On the effects of impulsivity and compulsivity on neural correlates of model-based performance

Abstract

Similar content being viewed by others

Expectation of reward differentially modulates executive inhibition

Lateral prefrontal model-based signatures are reduced in healthy individuals with high trait impulsivity

Does cognitive control ability mediate the relationship between reward-related mechanisms, impulsivity, and maladaptive outcomes in adolescence and young adulthood?

Introduction

Results

Behavior

EEG data

First-level results: Task effects

Second-level results: Impulsivity and compulsivity effects

Discussion

Methods

Participants

Procedure and measures

Two-step task

Personality scales

Impulsivity

Compulsivity

Data acquisition and analysis

Behavioral data and computational modelling

First-stage choice data

Computational modelling

EEG recording and data reduction

EEG analyses

First-level analyses

Second-level analyses

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation