Introduction

Cognitive flexibility and motivated behavior are essential for survival, and their dysfunction plays an important role in the clinical manifestations of multiple neuropsychiatric and neurodegenerative disorders. The catecholamines norepinephrine and dopamine are important for regulating a range of processes including motivation, reinforcement learning, and movement [1,2,3,4], and abnormal catecholamine function is central to cognitive/motivational dysfunction observed in many neurological disorders, dementias, and psychiatric disorders [5,6,7]. Several studies have highlighted the importance of cortical catecholamines, cortico-striatal circuits, and striatal dopamine signaling in regulating cognitive flexibility and goal-directed behavior [8,9,10,11,12,13,14]; however, the contributions of cortical catecholamines to striatal dopamine regulation and cognitive flexibility are not clear.

Pharmacological inhibition of the norepinephrine transporter (NET) not only elevates cortical norepinephrine but also cortical dopamine, as expression of the dopamine transporter (DAT) is very low in the cortex [15,16,17], and NET serves as the primary reuptake site for cortical dopamine [18,19,20,21]. NET is a common pharmacological target for multiple psychiatric disorders, and drugs targeting NET have been well studied in the context of cognition, anxiety and depression [22,23,24]. These pharmacological agents have mixed pharmacology at monoamine transporters and/or multiple off-target effects [25,26,27,28], however, which renders it difficult to ascertain the true effects of NET-specific inhibition. Genetic deletion of NET via NET knockout (NET-KO) mice represents an alternative tool to study the specific effects of NET dysfunction, given that changes in levels of NET expression and NET genetic polymorphisms might bear associations with multiple psychiatric conditions [29,30,31,32,33]. Interestingly, NET-KO mice have not only the same alterations in cortical catecholamines as pharmacological NET inhibition, but also have lower striatal DA levels as measured by microdialysis [19], revealing these mice as a unique model for studying the effects of cortico-striatal catecholamine signaling and its effects on cognitive flexibility.

The individual contributions of cortical NE and DA to distinct aspects of cognitive flexibility are not clear. Although pharmacological blockade of NET (elevated DA and NE) enhances reversal learning [34,35,36] and attentional set shifting [37], some studies suggest that NE is important for attentional set shifting but not reversal learning [37,38,39,40,41]. In contrast, dopamine and its receptors are sufficient to regulate both set shifting and reversal learning [8, 9, 42,43,44]. Striatal dopamine regulates reversal learning as well [45, 46], but how cortical catecholamines regulate tonic striatal dopamine and modulate cognitive flexibility is unclear. Microdialysis studies show an inverse relationship between dopamine levels in the prefrontal cortex (PFC) and striatum [47, 48]. Moreover, dopamine depletion in the PFC via pharmacological lesion increases striatal dopamine levels and amphetamine-induced locomotor activity compared to non-lesioned animals [47, 49, 50]. A recent study from our group further showed that elevating PFC dopamine but not norepinephrine levels can reverse hyperlocomotion in DAT-KO mice, which have elevated striatal dopamine [21]. To further clarify the role of cortical catecholamine regulation of striatal dopamine and cognitive flexibility, we used genetic (NET-KO mice) and lesion (mPFC NE lesion) models to test the effect of altered cortical catecholamines on cognitive flexibility and phasic striatal dopamine dynamics.

Methods

Animals and drugs

All studies were conducted in accordance with NIH guidelines for animal care and use and with an approved animal protocol from the University of Florida Institutional Animal Care and Use Committee. The NET-KO [19] mice were obtained from Dr. Marc Caron (Duke University). These knockout mice and littermate controls were backcrossed on to a C57BL6/J background for at least 10 generations and maintained on this background. Both male and female mice (3–5 months at testing) were used for experiments and housed in same-sex groups of 2–5 mice/cage, with ad libitum food (chow) and water except as noted below. In experiments that involved food restriction, mice were individually housed and gradually reduced to 85% of their free-feeding weight and maintained at this weight for the duration of these experiments. 6-hydroxydopamine (6-OHDA) was purchased from Sigma and dissolved in saline containing 0.02% ascorbic acid.

Probabilistic reversal learning task (PRL)

The PRL task was performed as described previously [13]. Briefly, the task was conducted in 8 identical operant chambers (15 × 13 × 13 cm, Med Associates) in individual sound-attenuating cubicles with a house light, two illuminated nosepoke holes on the chamber wall opposite the houselight, and a pellet (20 mg, Bio-Serv, Catalog# F0071) delivery trough between the two nosepoke holes. Prior to PRL testing, mice underwent several stages of shaping, including magazine training and nosepoke shaping, first on a FR1 schedule, and then on a FR1 schedule with a 50% probability of food delivery. In the PRL task itself, mice learned to discriminate between the two nosepoke holes that were 80% or 20% reinforced by food reward. On each 15 s trial, the houselight and both nosepoke holes were illuminated, and a response at either hole triggered the respective contingency. After a nosepoke response was recorded, lights turned off for an intertrial interval that lasted until the next trial. Failure to respond within 10 s was counted as an omission. Each trial lasted 15 s in duration, including the intertrial interval.

Once mice responded on 8 consecutive correct trials at the correct nosepoke (excluding omissions), the probabilities of food delivery at the two nosepokes were reversed. Each session was 50 min in duration and contained 200 trials. The primary measure of interest in the PRL task was the number of reversals completed per session. Additional measures in each session included the number of omitted trials, total number of correct responses, total number of errors, total number of inactive nosepokes (i.e., nosepokes during intertrial intervals when the house light was extinguished), latencies to initiate a nosepoke following trial onset, choice strategy (win-stay or lose-shift) [13, 51], errors to initial discrimination, errors to first reversal, trials to first reversal, and perseverative errors.

Surgical procedures

For NE lesions, mice were bilaterally injected with 200 nl of 3 ug/ul 6-hydroxydopamine (6-OHDA) in the dorsomedial prelimbic PFC (AP + 2.4 ML ± 0.6 DV −1.8 from Bregma). Two-three weeks later, the mice underwent training and testing in the PRL task as described above.

For fiber photometry, mice were unilaterally injected with 200 nl of AAV5-CAG-dLight 1.3b (Addgene) in the dorsomedial striatum (DMS) (AP + 1.0 ML ± 1.5 DV −3.0 from Bregma). A fiber optic cannula (400 μm, 0.66NA, Doric Lenses) was inserted into the brain 200 μm above the dLight injection site in the DMS and secured using dental cement with a bone anchor screw. Two mice from each genotype were also co-injected with AAV8-hSyn-mCherry-FlpO (Addgene) in the DMS as an expression control. Two-three weeks later the mice underwent training and testing for the PRL task as described above.

Fiber photometry

Dopamine transients were measured using a RZ10X TDT fiber photometry LED processor (Tucker Davis Technologies), which delivered 465 nm and 405 nm (isosbestic, control for motion artifacts) excitation light through a low autofluorescence optical fiber patch cord connected through a bronze sleeve to the metal ferrule of the fiber optic cannula implanted in the DMS. Excitation and emission signals were separated using a 6-port fluorescence minicube (FMC, Doric Lenses). TTLs from Med Associates hardware were transferred through a superport card connected to the RZ10X digital I/O port. Photometry signals were analyzed using pMAT software from the Barker lab [52].

Progressive ratio task (PRT)

A progressive ratio task was used to assess motivation to work for food as described previously [53, 54]. Briefly, mice underwent similar shaping in the same Med Associates operant chambers (magazine training, FR1) used in the PRL task but also underwent FR3 and FR5 training to acclimatize them to exerting increased effort to earn food. The mice were then tested in the progressive ratio task for 11 days, using the formula x = (5*e^reward*0.24))−5, in which x = 2, 4, 6, 9, 12, 15, 20, 25, 32, 40, 50, 62, 77, 95,118, 145, 178, 219, 268, 328…..901 nosepokes required to earn a single food pellet in each daily session. Failure to earn a food pellet within 20 min terminated the session and the previous ratio achieved was considered the “breakpoint”. This breakpoint was the primary measure of interest in the task, but the total number of nosepokes and the total number of food pellets earned were also assessed.

Monoamine tissue level - HPLC analysis

After behavioral testing, 6-OHDA- and vehicle-injected mice were sacrificed, brains isolated, and the mPFC was dissected to assess monoamine content. Monoamine levels were determined by HPLC analysis with electrochemical detection (EICOM) as described previously [21].

Nestlet shredding assay

The nestlet shredding assay was performed as described previously [55]. Mice were placed in fresh cages with a single nestlet (square-shaped compressed cotton) that was weighed prior to testing. Mice were left in the cages for 4 h, and the percent weight of unshredded nestlet remaining was calculated each hour. Data are represented as percent nestlet shredded (inverse of the percent nestlet remaining).

Locomotor activity

Locomotor activity was measured in a Versamax activity monitor (16 × 16 in, Omnitech Electronics, Columbus, OH) as described previously [56]. Mice were placed in the activity monitor, and activity (total distance traveled) was measured in 5 min bins for a total of 90 min.

Statistical analyses

Statistical analyses were performed using Graphpad Prism 10.0. Group sizes were based on our previously published studies, as well as on power analyses suggesting that they would yield power to detect significant (alpha = 0.05) effects of 0.8 and above. Data were analyzed using unpaired t-test, two-way ANOVA, or mixed-effects model with repeated measures (behavior), for comparison between genotypes, day, injection pairs, or lesion and monoamine content as described in figure legends. Individual factors were compared using post hoc Tukey tests described in figure legends. Sex was included as a variable in all analyses, but as no main effects or interactions involving sex were significant, they are not reported further. Data are presented as means ± SEM. No data points were excluded. Values of p < 0.05 were considered significant.

Results

As NET-KO mice have elevated cortical catecholamines and lower striatal dopamine [19], we reasoned that these mice should also show altered performance on cognitive tasks mediated by cortico-striatal catecholamine signaling. To address this question, we used a cognitive flexibility task (probabilistic reversal learning) that assesses the ability of mice to rapidly switch between response contingencies to earn food rewards [57]. In this task, analysis of number of reversals per session in WT and NET-KO mice revealed significant main effects of genotype and days but no interaction effects were observed during the 10 days of testing (Fig. 1A, Mixed-effects model Repeated measures, Genotype, F (1, 22) = 8.292, P = 0.0087; Day, F (4.986, 109.7) = 2.604, P = 0.0290; Day × Genotype, F (7, 154) = 0.9720, P = 0.4537). The NET-KO mice also made fewer trial omissions than WT mice (i.e., they completed a greater number of trials per session; unpaired t-test, WT vs NET-KO, Two-tailed, t = 5.089, df = 14, P = 0.0002, Fig. 1B). This greater number of completed trials did not account for the greater number of reversals, however, as normalization of the number of reversals by the number of completed trials per session (percent reversals per 200 trials) in WT and NET-KO mice still revealed significant main effects of genotype and days (Fig. 1C, Mixed-effects model Repeated measures, Genotype F (1, 21) = 9.436, p = 0.0058; Days, F (5.722, 115.2) = 3.042, p = 0.0095; Days × Genotype F (7, 141) = 1.010, p = 0.4268). Total reversals averaged over all 10 days also revealed significant genotype effects (unpaired t-test, Two-tailed, t = 2.883, df = 21, p = 0.0089, WT vs NET-KO, Fig. 1D). NET-KO mice had a greater number of nosepokes during intertrial intervals when the nosepokes were inactive (inactive nosepokes; unpaired t-test, WT vs NET-KO, Two-tailed, t = 2.609, df = 14, P = 0.0206, Fig. 1E), but no differences were observed when inactive nosepokes were normalized to total no. of nosepokes (Fig. 1F). Although there was a trend for the NET-KO mice to be lower than WT mice, no significant main effects were observed in the number of errors to criterion on the initial discrimination (Fig. 1G) or trials to first reversal (Fig. 1H) in each daily session. However, a significant main effect of days for errors to criterion after first reversal (Fig. 1I, days, p = 0.0017, F (3.484, 68.91) = 5.198) was observed with no genotype differences. Additionally, NET-KO mice made more active correct responses (correct) on the 80% probability nosepoke, earned more food pellets, and had fewer perseverative errors (day 1) compared to WT mice but did not differ in the number of responses at the 20% probability nosepoke (incorrect) or in latencies to respond (Supplementary Fig. S1) averaged over each day of PRL. Furthermore, analysis of choice strategies revealed greater win-stay and less lose-shift behavior in NET-KO mice compared to WT controls (Supplementary Fig. S2), suggesting that NET-KO mice are more sensitive to both positive and negative feedback, which presumably optimizes their subsequent choices to enhance reversal learning.

Fig. 1: NET-KO mice have enhanced reversal learning in a Probabilistic Reversal Learning (PRL) task.
figure 1

NET-KO mice have A significantly greater numbers of reversals per session, *p < 0.05 Mixed-effects model main genotype effect and B fewer trial omissions for each genotype averaged over each session for 10 days, ***p < 0.001 t-test, WT vs NET-KO, C greater number of reversals normalized by number of completed trials, from Day 1 through Day 10. **p < 0.01 Mixed-effects model main genotype effect, NET WT vs KO. D More total reversals averaged over each session for 10 days **p < 0.01 t-test, WT vs NET-KO, E more inactive nosepokes averaged over each session for 10 days, *p < 0.05 t-test, NET WT vs KO but F similar normalized inactive nosepokes averaged over each session for 10 days, and similar G errors to criterion initial discrimination, H Trials to 1st reversal and I errors to criterion after 1st reversal, compared to WT controls. n = 11–12 mice per genotype (6/7 WT and 6/4 NET-KO male/female mice).

Microdialysis studies show that genetic or pharmacological disruption of NET causes elevated cortical catecholamines but lower baseline levels of striatal dopamine [19, 58, 59]. It is well known that striatal dopamine dynamics play an important role in reinforcement learning. Specifically, phasic striatal dopamine signals encode reward prediction error (RPE)-like signals that inform future choices [1, 60]. We asked whether changes in cortical catecholamines and baseline striatal dopamine play a role in altered phasic striatal dopamine dynamics, which could explain the enhanced cognitive flexibility of the NET-KO mice. We used dLight 1.3b and fiber photometry to measure dopamine dynamics in the dorsomedial striatum in WT and NET-KO mice while performing the PRL task. We focused on dLight measurements in the DMS as the PRL task is an instrumental learning task that relates to action-outcome learning, which is predominantly mediated by the DMS [61,62,63]. We measured phasic dopamine transients at early (day 1) and late (day 10) phases of the PRL task, at specific peri-event timepoints such as cue light, nosepoke, reward delivery, and reward omission (dashed line in figure denotes onset of respective event). We achieved consistent targeting of dLight 1.3b AAV and the optic fiber in the DMS (Fig. 2A) of WT and NET-KO mice. In mice co-injected with an mCherry AAV to control for expression between genotypes, we found that expression of mCherry was similar between genotypes. We were able to consistently measure stable dopamine signals throughout the 50 min behavioral sessions in both genotypes (Fig. 2B). Similar to Fig. 1, we observed genotype differences in reversal learning in mice used for photometry measurements (Supplementary Fig. S3a). During early PRL task training, at the high probability (80%) nosepoke, both WT and NET-KO mice showed an RPE-like pattern of dopamine signals, but NET-KO mice had significantly larger phasic dopamine responses during reward delivery (Fig. 2C, 80% nosepoke reward, i-peri-event histogram (PETH), ii-AUC and iii-peak response), reward omission (Fig. 2D, PETH, 80% nosepoke no reward, i-peri-event histogram (PETH), ii-AUC and iii-peak response), and during cue presentation (Supplementary Fig. S3b). Phasic dopamine responses at the low probability (20%) nosepoke were also greater in NET-KO compared to WT controls during reward delivery (Fig. 2E, 20% nosepoke reward, i-peri-event histogram (PETH), ii-AUC and iii-peak response) and omission (Fig. 2F, 20% nosepoke no reward, i-peri-event histogram (PETH), ii-AUC and iii-peak response). In contrast, during late phases of the PRL task, all phasic dopamine responses were similar in WT and NET-KO mice (Fig. 2G–J). Analysis of AUC and peak values of phasic dopamine responses for daily sessions from day 1 to 10 revealed significant genotype and interaction effects during reward delivery but not reward omission, particularly during early training (Supplementary Fig. S3c–f). These results are consistent with the enhanced initial learning shown by the NET-KO mice in the PRL task.

Fig. 2: NET-KO mice have enhanced phasic dopamine transients in the Probabilistic Reversal Learning (PRL) task.
figure 2

A Fiber photometry setup and representative images of optical probe implant site and dLight and mCherry AAV expression in the DMS of WT and NET-KO mice. B Dopamine measurements using dLight 1.3b in the 465 nm channel over the course of an entire daily PRL session of 50 min. CJ i -peri-event time histograms (PETH), ii - AUC ΔF/F values (peri-event: 0–2.5 s) and iii – Peak ΔF/F responses (peri-event: 0–2.5 s) of phasic dopamine transients in WT and NET-KO mice for different reward contingencies as mentioned in the figure, at early (CF) or late (G–J) phases of PRL task. Dashed line denotes onset of reward delivery or omission. *p < 0.05, **p < 0.01 unpaired t-test, WT vs NET-KO; ns not significant. n = 5 mice per genotype (4 male/1 female, WT and NET-KO mice).

In addition to elevated norepinephrine levels, NET-KO mice also have elevated dopamine levels in the prefrontal cortex [18]. To address the specific roles played by cortical norepinephrine and dopamine in reversal learning, we used the toxin 6-OHDA to lesion norepinephrine terminals in the mPFC (prelimbic) of WT mice, followed by PRL testing. Within the PFC the prelimbic and OFC regions have apparently opposing roles in reversal learning performance [13], such that inactivation of the prelimbic region enhances cognitive flexibility. As shown in Fig. 3A, reversal learning in 6-OHDA-injected mice was not different from vehicle-injected controls, and the two groups had similar numbers of trial omissions (Fig. 3B) and inactive nosepoke responses (Fig. 3C). To confirm depletion of norepinephrine in the PFC, we isolated tissue from lesioned mice after behavioral testing and measured catecholamine levels in the PFC using HPLC. As expected, we achieved selective depletion of NE terminals, as expression of DAT in the PFC is very low (Fig. 3D, Two-Way ANOVA, lesion x monoamine F (2, 26) = 19.34 p < 0.0001). These data suggest that mPFC NE and DA play distinct roles in reinforcement learning and cognitive flexibility.

Fig. 3: Probabilistic Reversal Learning is intact in NE lesioned mice.
figure 3

Compared to vehicle (Veh) controls, NE lesioned mice (6-OHDA) showed: A similar numbers of reversals per session from Day 1 through Day 8. p = 0.77 Mixed-effects model Repeated measures, Veh vs 6-OHDA; B similar numbers of trial omissions averaged over each session for 8 days, p = 0.73, t-test, Veh vs 6-OHDA. C similar numbers of inactive nosepokes averaged over each session for 8 days p = 0.82, t-test, Veh vs 6-OHDA. and D lower levels of NE but not DA or 5-HT as measured by HPLC analysis of PFC tissue. **p < 0.01 Two-Way ANOVA, Veh vs 6-OHDA. n = 6 (3 male and 3 females) mice for each treatment group.

To decipher whether NE plays a role in behaviors other than probabilistic reversal learning, we tested NET-KO and mPFC NE-lesioned mice in simple stress- and anxiety-related behavioral assays [64, 65]. Depletion of norepinephrine renders mice more susceptible to stress and anxiety, whereas enhancing NE levels can mimic antidepressant-like effects [66, 67]. The nestlet shredding task, conducted in a novel environment, is useful for measuring stress-induced repetitive, compulsive behavior during physiological stress or anxiogenic situations - i.e., nest building may be a means to actively cope with anxiogenic stimuli [68]. As seen in Fig. 4, NET-KO mice shredded nestlets significantly more than WT controls in the 4-hour testing period (Mixed-effects model Repeated measures, Time x Genotype, F (4, 56) = 4.497, P = 0.0032). In contrast, PFC NE-lesioned mice shredded the nestlets significantly less than vehicle controls (Fig. 4, Mixed-effects model Repeated measures, Time x Lesion, F (4, 40) = 5.165, P = 0.0019). Additionally, as observed previously [19], NET-KO mice showed significantly less novelty-induced activity in an open field compared to WT controls (Fig. 4C, Mixed-effects model Repeated measures, Time x Genotype, F (17, 221) = 2.657, P = 0.0006) and a shorter cumulative total distance traveled (Fig. 4D, unpaired t-test, NET WT vs KO, Two-tailed, t = 3.191, df = 13, P = 0.0071). Thus, NE bidirectionally modulates nestlet shredding, again highlighting distinct roles for cortical DA and NE in specific behaviors.

Fig. 4: Nestlet shredding and open-field activity in NET-KO mice.
figure 4

A NET-KO and WT control, n = 8 per genotype or B PFC NE-lesioned or vehicle (Veh) control, n = 8 per group mice were placed in fresh cages with nestlets, and weights of nestlets were measured every hour for 4 h. Data are presented as percent of the nestlet shredded. *P < 0.05 by Mixed-effects model Repeated measures, post hoc Tukey’s test, comparing NET WT and KO (4 male and 4 female mice for KO and 5 male and 3 females for WT mice) or Veh control and PFC NE lesion (4 male and 4 female mice for WT and KO mice). C, D Mice were placed in an open field chamber and distance traveled (locomotor activity) was measured for 90 min. n = 7–8 per genotype **P < 0.01 by t-test comparing NET WT and KO (4/5 male and 3 female mice for KO/WT mice).

The heightened cognitive flexibility and fewer trial omissions observed in NET-KO mice in the PRL task could imply that these mice are more motivated to work for food. To address this possibility, a separate group of mice was trained to respond for food on a progressive ratio schedule of reinforcement, in which each successive reinforcer earned increases the number of responses required to earn the subsequent reinforcer, until responding ceases altogether (i.e., breakpoint) [53, 69]. Surprisingly, there was no difference in the breakpoint between WT and NET-KO mice (Supplementary Fig. S4A), nor were there differences in the total number of responses or food pellets earned (Supplementary Fig. S4B, C), suggesting equivalent levels of food motivation.

Considered together, these results suggest distinct roles for cortical norepinephrine and dopamine in cognitive flexibility and motivated behavior. Furthermore, in NET-KO mice task performance during reversal learning parallels the magnitude of phasic dopamine transients in the DMS, even though tonic dopamine levels are low.

Discussion

In the current study we show that mice with genetic deletion of NET display enhanced cognitive flexibility as measured by reversal learning, with a simultaneous increase in the magnitude of phasic dopamine transients in the DMS. Furthermore, we show that PFC NE levels are crucial for nestlet shredding but not reversal learning. Cognitive flexibility promotes survival in the natural world by allowing adaptive decision making in an ever-changing environment. Cognitive flexibility also allows switching of responses during cue-response-outcome behavior to enable formation of new associations [45, 70]. Deficits in switching or behavioral flexibility can lead to adverse outcomes, some of which are apparent in conditions such as ADHD, schizophrenia, OCD, and substance use disorders. In ADHD in particular, lack of cognitive flexibility can be in part driven by hyperactivity and/or impulsivity that is reversed by pharmacological treatment with monoamine transporter blockers [71]. One of the mechanisms thought to be involved in reversing ADHD phenotypes is the targeting of norepinephrine signaling via NET or alpha-adrenergic receptors in the PFC [72]. It is not clear, however, if the consequences of pharmacological targeting of NET are due to cortical NE or DA and how they affect striatal dopamine dynamics and cognitive flexibility. To test whether NET blockade can alter cognitive flexibility, we evaluated NET-KO mice in a probabilistic reversal learning task. NET-KO mice made more reversals per session compared to WT controls, suggestive of enhanced cognitive flexibility. NET-KO mice also had significantly fewer trial omissions, but normalization of reversals by the number of completed trials produced results comparable to those with completed reversals alone though, indicating that the reduction in omissions does not account for apparently greater flexibility in NET-KO mice.

NET deletion not only elevates NE but also cortical DA, as NET is the primary site of DA reuptake in the cortex. Selective dorsal PFC NE depletion that maintained PFC DA levels did not affect any measure of reversal learning task performance, consistent with a previous study [38] and suggesting that PFC DA and NE differentially regulate cognitive flexibility. PFC DA and its receptors within both dorsal and ventral regions of the PFC have been shown to be sufficient to regulate cognitive flexibility [8, 9, 73]. In contrast, the specific contributions of NE to reversal learning are less clear, as some studies suggest that NE in the OFC might play a role in action-outcome updating in an outcome devaluation reversal task [40, 41]. PFC NE might also play a role in regulating behavior under stressful or aversive conditions [65]. The nestlet shredding test is often used as a measure of repetitive or compulsive behavior under anxiogenic or stressful situations - i.e., “shred the nestlet to create a safe harbor in an anxiety-inducing novel cage environment” [55]. Moreover, regions of the PFC including the anterior cingulate cortex (ACC) [74, 75] and the OFC [11] have been implicated in compulsive behaviors. Compared to WT mice, the NET-KO mice (which have elevated NE and DA levels in PFC) shredded more nestlet across the 4-hour test, consistent with the idea that these mice perform actions faster to avoid aversive stimuli, which could be on the compulsive behavior spectrum. Conversely, dorsal PFC (ACC and prelimbic) NE depletion (which did not affect DA levels) caused a reduction in nestlet shredding, which is consistent with prior work showing that DBH-KO mice (which have reduced NE but elevated DA levels in PFC) have reduced nestlet shredding [55]. Interestingly, NET KO mice have greater number of inactive nosepokes in the PRL task (noksepokes when the houselight was off and responses had no programmed consequences), which could suggest a compulsive-like behavioral phenotype (although the opposite effect was not evident in the PFC NE-depleted mice). Together, the results from the nestlet shredding task suggest that PFC NE, but not DA, can bidirectionally modulate expression of repetitive/compulsive behaviors, particularly in novel or anxiety-inducing contexts.

In the PRL task, we found larger phasic dopamine transients in the DMS of NET-KO mice in early phases of the task, even though NET-KO mice have lower baseline striatal dopamine levels as shown previously by microdialysis [19], which could explain why they complete more reversals early in the task compared to WT controls. Larger phasic dopamine transients have been thought to translate to stronger reinforcement signals and therefore stronger associations between cues or actions and outcomes [1, 76, 77]. One hypothesis is that NET-KO mice learn at a faster rate than WT mice because they have lower baseline dopamine levels, which allows for larger phasic dopamine bursts relative to baseline—i.e. a higher signal-to-noise ratio [78]. Evidence for this notion comes from a study showing that NET-KO mice have elevated DAT protein but not mRNA expression in the midbrain [79], which could lead to enhanced reuptake and lower extracellular DA, but greater vesicular stores of DA. The elevated vesicular stores of DA would then cause elevated release of DA during phasic activity bursts. The upregulation of only DAT protein but not mRNA levels in the NET KO mice suggests that this is potentially not a compensatory developmental effect but rather a network compensatory effect, since changes in surface DAT protein levels can be induced in an activity-dependent manner - i.e., via depolarization [80]. Mice learn action-outcome associations in the PRL task not only from rewarded trials (positive reinforcement/WinStay) but also from non-rewarded (reward omission/LoseShift) trials. Interestingly, both positive and negative dopamine transients during reward delivery and omission, respectively, are sharper in NET-KO mice in early stages of PRL training, suggesting that NET-KO mice have access to more salient information from both trial types, consistent with their Winstay/Loseshift behavior. However, in WT mice negative dopamine transients are not as sharp as the positive transients, suggesting that WT mice are updating information predominantly from positive reinforcement signals. Intriguingly, in the late phases of PRL training in both NET-KO and WT mice, positive and negative dopamine transients during reward delivery and omission, respectively, were more consistent and sharper, suggesting that unlike early phases, in these later stages of learning WT mice use both positive reinforcement and reward omission signals equally to inform future outcomes.

The larger phasic dopamine signals and enhanced cognitive flexibility could imply that NET-KO mice might have a greater motivational drive to perform tasks. Surprisingly, NET-KO mice show breakpoints similar to WT mice in a progressive ratio task, suggesting that they are not more motivated than WT mice to work for food rewards. Some studies suggest that elevated dopamine levels correlate with enhanced motivation in the progressive ratio task [81, 82], and thus one might expect no change or even reduced food motivation in NET-KO mice, which have lower tonic dopamine levels. One potential explanation for this disconnect between motivation and cognitive flexibility is that the NET-KO mice are more attentive or have greater task engagement (fewer trial omissions) but are not necessarily more motivated. A recent study suggests that greater DMS dopamine terminal phasic activity (as measured by a GCaMP sensor) correlates with punishment-resistant, goal-directed behavior [83]. It remains to be determined if the enhanced DMS phasic dopamine signals in NET-KO mice are associated with greater punishment-resistant, goal-directed behavior.

Although the NET-KO mice provide a unique model in which to study cognitive flexibility, there are possible compensatory developmental mechanisms to consider, owing to chronic changes in catecholamine levels. Developmental changes alone likely do not account for the observed behavioral/neurochemical alterations in these mice, however, as chronic administration of NET blockers induces behavioral changes similar to those observed in the NET-KO mice [19]. In addition, it is important to acknowledge that NET is widely expressed not only in the cortex but also in other regions including the bed nucleus of the stria terminalis, medial shell of nucleus accumbens, midbrain, and hippocampus. Thus, it is possible (and even likely) that the behavioral phenotypes in NET-KO mice result from effects not only in the cortex but also in these other regions. Although we partially addressed this caveat with the specific lesions of mPFC NE terminals, future studies using region-specific NET, PFC NE, or PFC DA manipulations will be important for highlighting the specific role(s) of NET and catecholamines in these other brain regions.

Blockade of NET has been a major pharmacological approach for treating multiple disorders, and it is hypothesized that increased cortical NE is a predominant mechanism of therapeutic action. In our previous work we showed that systemic administration of nepicastat (an inhibitor of the dopamine beta hydroxylase enzyme that converts dopamine to norepinephrine) caused reduced NE but elevated DA levels in PFC, and reduced the hyperactivity phenotype in DAT-KO mice [21]. Thus, the combination of depleted NE and elevated DA in PFC might be a useful pharmacological approach to reduce impulsivity and hyperactivity while maintaining or even enhancing cognitive flexibility. In combination with the current findings, our studies show distinct roles for cortical NE and DA in various aspects of cognitive flexibility. Thus, targeting cortical NE and/or DA will likely have distinct therapeutic actions and should be considered when designing new therapeutic approaches.