Introduction

Occasional cocaine use can progress to uncontrolled abuse and dependence in some individuals, resulting in health, interpersonal, and occupational challenges that interfere with healthy daily functioning. In addition to neuronal mechanisms, neuro-immune signaling may contribute to cocaine use disorder (CUD) (Hill et al. 2010; Valdizán et al. 2012; Theberge et al. 2013; Korpi et al. 2017) (Hutchinson et al. 2008; Hutchinson and Watkins 2014; Lacagnina et al. 2017; Bachtell et al. 2017; Hofford et al. 2019). The evidence for altered immune function in CUD has been indicated in clinical populations of cocaine users. For instance, individuals diagnosed with CUD have increased peripheral levels of the proinflammatory cytokine, interleukin-6, and decreased levels of the anti-inflammatory cytokine, interleukin-10 (Moreira et al. 2016). Treatment-seeking cocaine-dependent individuals also exhibited increased acute levels of the pro-inflammatory cytokine, tumor necrosis factor-alpha, in response to stress-related or drug-associated cue imagery (Fox et al. 2012). CUD symptom severity was also associated with greater alterations in peripheral proinflammatory markers, indicating a correlative relationship between behavioral symptoms of CUD and immune activation (Araos et al. 2015).

The significance of these inflammatory changes and their impact on brain function are not fully understood, but immune-related signaling molecules are known to impact neurotrophic and neuromodulatory functions (Pedraz et al. 2015; Araos et al. 2015; Moreira et al. 2016; Levandowski et al. 2016; García-Marchena et al. 2019; Soder et al. 2020; Stamatovich et al. 2021). Anti-inflammatory drugs, such as minocycline and ibudilast, alter drug-related behaviors and relevant neural activity in pre-clinical addiction models, and clinical studies have found some promising, but limited, efficacy with these drugs (Liu et al. 2006; Chen et al. 2009; Beardsley et al. 2010; Fujita et al. 2012; Attarzadeh-Yazdi et al. 2014; Bell et al. 2015; Arezoomandan and Haghparast 2016; Worley et al. 2016; Cooper et al. 2016, 2017; Metz et al. 2017; Birath et al. 2017; Arezoomandan et al. 2018; Li et al. 2020).

The Toll-like receptor 4 (TLR4) system, an innate immune receptor complex expressed primarily by microglia in the brain, is involved in psychostimulant- and opioid-induced immune activation (Jacobsen et al. 2014; Bachtell et al. 2017; Wu and Li 2020). TLR4 signaling initiates an acute inflammatory cytokine response by recognizing molecular patterns associated with pathogens, such as lipopolysaccharide (LPS) on the cell walls of gram-negative bacteria (Kawai and Akira 2007). TLR4 signaling is also activated by molecular markers of cellular damage (e.g., high mobility group box 1 (HMGB1)) and some exogenous molecules exogenous or xenobiotics (Northcutt et al. 2015). Single injections of cocaine or multiple days of cocaine self-administration initiate transcriptional expression of inflammation-related genes in regions of the mesolimbic dopamine (DA) system that is well-characterized for its involvement in drug reinforcement and relapse (Northcutt et al. 2015; Brown et al. 2018; Burkovetskaya et al. 2020). Multiple lines of evidence suggest that TLR4 is involved in cocaine-induced inflammatory signaling in the brain (Northcutt et al. 2015; Wang et al. 2019). Importantly, pharmacological manipulation of TLR4, both systemically and within the ventral tegmental area (VTA), alters nucleus accumbens (NAc) DA release in response to psychostimulants and opioids (Northcutt et al. 2015; Tanda et al. 2016; Wang et al. 2019).

The involvement of TLR4 in cocaine reinforcement and seeking models has been explored with TLR4-specific agonists and antagonists. Notably, (+)-naloxone and (+)-naltrexone are stereo-specific, blood-brain barrier permeable (+)-enantiomers of opioid receptor antagonists that are antagonists for TLR2 and TLR4 (Hutchinson et al. 2008; Northcutt et al. 2015; Zhang et al. 2018; Wang et al. 2019). Unlike the (-)-enantiomers, the (+)-enantiomers are devoid of opioid receptor activity making them interesting compounds for exploring TLR2/4-induced effects. These drugs block TLR4 via competitive binding at the myeloid differentiation factor 2 (MD2) LPS pocket of the TLR4 complex (Zhang et al. 2018). Additionally, LPS from the bacterial species Rhodobacter sphaeroides (LPS-Rs) has been well-characterized as a TLR4 antagonist, although it is not blood-brain barrier permeable and must be administered intracranially when targeting specific brain regions. In accordance with the hypothesis that cocaine induces neuroimmune signaling through TLR4, TLR4 antagonists, such as (+)-naltrexone, (+)-naloxone, and LPS-Rs, block pro-inflammatory neuro-immune transcriptional expression to cocaine in isolated microglia preparations and in vivo within the brain parenchyma (Northcutt et al. 2015). In addition to blocking pro-inflammatory cytokine expression, (+)-naltrexone and (+)-naloxone reduce both operant responding for cocaine and expression of conditioned preference for psychostimulants in some experimental conditions (Northcutt et al. 2015; Tanda et al. 2016; Wang et al. 2019). In addition to (+)-naltrexone and (+)-naloxone, intracranial administration of LPS-Rs into the VTA reduced the reinstatement of operant responding for cocaine but not sucrose (Brown et al. 2018). Together, these studies point to a role of the TLR4 system in cocaine reinforcement and drug-seeking; however, some questions remain regarding the efficacy of these drugs in models of motivated drug-taking and drug-seeking (Tanda et al. 2016; Yue et al. 2019).

To further understand TLR4 behavioral pharmacology in the context of CUD, we assessed the impact of TLR4 antagonists in cocaine self-administration and cocaine-seeking models. We first evaluated the continuous administration of (+)-naltrexone during cocaine self-administration to evaluate the impact of TLR4 antagonism on (1) acquisition and maintenance of cocaine self-administration, (2) cocaine reinforcement using a progressive ratio (PR) schedule of reinforcement, and 3) cocaine seeking during forced abstinence and following extinction training.

Materials and methods

Animals

A total of 188 male Sprague-Dawley rats (Envigo, Indianapolis, IN), weighing between 330 and 350 g upon arrival, were singly housed in a temperature (72 °F) and humidity (40%) controlled animal vivarium on a standard 12-h light/ 12 h dark cycle. Rats were provided standard rat chow (Envigo, Indianapolis, IN) and water ad libitum throughout the entirety of the study unless otherwise noted. All experiments were performed during the light cycle. All procedures were conducted per institutional guidelines outlined by the Institutional Animal Care and Use Committee at the University of Colorado Boulder, an Association for Assessment and Accreditation of Laboratory Animal Care International accredited institution.

Drugs

Cocaine HCl (Sigma-Aldrich, St. Louis, MO) was dissolved in sterile saline (0.9% NaCl) at a working concentration of 5 mg/mL and filtered daily through a 200-nm filter before self-administration. (+)-Naltrexone was synthesized and provided by Dr. Kenner Rice of the Drug Design and Synthesis Section, National Institute on Drug Abuse and National Institute on Alcohol Abuse and Alcoholism, National Institutes of Health. (+)-Naltrexone was synthesized as (+)-naltrexone.HCl.4H2O with added waters of hydration to improve solubility. All doses are reported as (+)-naltrexone free base. Selection of doses and administration methods is based on prior behavioral work assessing similar outcome variables and work evaluating the pharmacokinetics of (+)-naltrexone (Theberge et al. 2013; Northcutt et al. 2015; Brown et al. 2018). Northcutt et al. (2015) reported that administration of 10 mg/kg (+)-naltrexone subcutaneously was absorbed very rapidly (tmax = 0.25 h), and the half-life of (+)-naltrexone was 1.19 ± 0.063 h. Administration of 10-mg/kg (−)-naltrexone (SC) showed a similar pharmacokinetic profile with (+)-naltrexone suggesting that absorption and distribution characteristics are similar between the two enantiomers. Thus, subcutaneous mini-pump dosing was based on prior work using osmotic mini-pump delivery of either (−)-naltrexone or (+)-naltrexone (Hill et al. 2010; Valdizán et al. 2012; Theberge et al. 2013; Korpi et al. 2017). Surgically implanted subcutaneous osmotic mini-pumps (ALZET, Cupertino, CA) delivered 15 mg/kg/day (+)-naltrexone for up to 14 days as reported previously (Theberge et al. 2013). Acute administration of (+)-naltrexone was administered as two half-doses (up to 60 mg/kg in total) 30 min apart to improve pharmacokinetic absorption and distribution throughout the testing session. The lipophilic nature of (+)-naltrexone makes it ideal for distribution following a systemic administration, but not a localized brain infusion. Therefore, the classical TLR4 antagonist, Lipopolysaccharide from the species Rhodobacter sphaeroides (LPS-RS), was selected given its well-characterized specificity for the TLR4 complex and comparative lack of diffusion. LPS-Rs (Sigma-Aldrich, St. Louis, MO) was administered intracranially at a dose of 5 μg/side based on prior work (Northcutt et al. 2015; Brown et al. 2018; Wang et al. 2019). The non-steroidal anti-inflammatory drug Ketofen (3 mg/kg) and the antibiotic Baytril (3 mg/kg) were dissolved in sterile saline (0.9% vol./wt.) and administered post-operatively immediately following surgery.

Surgical catheter implantation and cocaine self-administration procedures

Silastic tubing catheters were intravenously implanted into the right jugular veins of rats to allow for intravenous (iv) self-administration of cocaine HCl (0.5 mg/kg/inf., Sigma-Aldrich, St. Louis, MO). Following 5–7 days of recovery from surgery, rats began self-administration procedures in sound-attenuating operant chambers (Med-Associates, Georgia, VT) equipped with two retractable levers, a fan, a house light, and discrete cue lights located directly above the retractable levers. Commencement of self-administration sessions was occasioned by the illumination of the house light and the presentation of two levers located on one side of the chamber. Intravenous infusions of 100-μL cocaine HCl (0.5 mg/kg/infusion) were contingent upon lever presses on the drug-paired lever. Presses on the inactive lever had no programmed consequences. Infusions of cocaine occurred simultaneously with 5-s illumination of a discrete cue light located directly above the active, drug-paired lever. A 20-s time-out period followed cocaine infusion and was occasioned by extinguishing of the house light. Cocaine self-administration procedures were performed using a fixed-ratio 1 (FR1), fixed-ratio 5 (FR5), and progressive ratio (PR) schedule of reinforcement. During extinction procedures, drug-paired lever presses no longer resulted in iv cocaine infusions, drug-paired cue light illumination, or extinguishing of the house light.

Intracranial pharmacological microinjections

Rats were stereotaxically implanted with bilateral 26 ga. guide cannulae (PlasticsOne, Roanoke, VA) extending 8 mm below the plastic pedestal. Guide cannulae were stereotaxically inserted into the NAc Shell (A/P: +1.8, M/L: +0.8, D/V: −5.6) or NAc Core (A/P: +1.7, M/L: +1.5, D/V: −5.7) using coordinates adapted from Paxinos and Watson (2013). Placements were anatomically verified post-mortem via visual inspection of 1-μl cresyl violet microinjections via the surgically implanted guide cannula. Stereotaxic surgeries were performed during the same surgery as the catheter implantation. Dental cement fixed to 4 skull screws was used to secure the guide cannula to the skull. Microinjections of LPS-Rs (5 μg/side, 1-μl vol.) were delivered via a bilateral microinjector that was inserted into the guide cannulae and extended 1 mm beyond the tip of the cannulae. Microinjections of LPS-Rs were delivered at a rate of 200 nL/min. Following completion of the injection, the microinjector remained in the cannula for an additional 2 min to allow for diffusion of the drug. During control microinjections, rodents received 1-μL injections of sterile H2O at a rate of 200 nL/min.

Experiment 1: Effects of TLR4 antagonism on the acquisition of cocaine self-administration and subsequent cocaine seeking

To test the effects of TLR4 antagonism on the acquisition of cocaine self-administration, rats were implanted with subcutaneous osmotic mini-pumps (ALZET, model 2ML2, Cupertino, CA) to allow continuous delivery of 15 mg/kg/day (+)-naltrexone or saline for 14 days during cocaine self-administration. One day after mini-pump implantation, sated rats were allowed to lever press for iv cocaine infusions on an FR1 reinforcement schedule in 2-h sessions. Mini-pumps were removed before the 14th self-administration session and rats resumed cocaine self-administration for a total of 17 self-administration sessions. To evaluate the latent effects of continuous (+)-naltrexone during cocaine self-administration on subsequent cocaine seeking, cocaine-paired lever responding was extinguished in seven daily 2-h extinction sessions. Reinstatement of cocaine seeking was conducted in three separate test sessions using a within-subjects design. The first reinstatement test evaluated cue-induced cocaine seeking in a 3-h session. The session included an initial 2-h period under extinction conditions that was immediately followed by a 1-h reinstatement test period. During the reinstatement test period, 5 non-contingent presentations of the cocaine-associated cue (5-s) were delivered every 2-min over the first 10-min of the session, and responses at the previously cocaine-paired lever also resulted in a 5-s illumination of the cocaine-associated cue. The second and third reinstatement tests were counterbalanced to evaluate cocaine (or saline)-primed reinstatement of cocaine seeking. The drug-primed reinstatement session occurred under extinction conditions. An injection of cocaine (15 mg/kg, ip) or sterile saline (0.9% vol./wt., ip) was administered immediately before the 2-h reinstatement session.

Experiment 2: Effects of TLR4 antagonism on cocaine’s reinforcing properties

The effects of TLR4 antagonism were evaluated on the motivation to self-administer cocaine using PR-maintained cocaine self-administration. Rats acquired cocaine self-administration on an FR1 schedule for five daily 2-h sessions similar to Exp. 1 before being transitioned to an FR5 schedule of reinforcement for an additional five 2-h self-administration sessions. Immediately following one self-administration session using a PR schedule of reinforcement, rats were implanted with subcutaneous osmotic mini-pumps (ALZET, model 2ML1, Cupertino, CA) to allow continuous delivery of 15 mg/kg/day (+)-naltrexone or saline for 7 days. After mini-pump implantation, rats resumed testing using a PR schedule of reinforcement for 4 additional sessions. The progression for response/injection ratios was determined according to [5e(injection number × 0.2)]−5 (e.g., 1, 2, 4, 6, 9, 12, 15, 20, 25, 32, 40, 50). In a separate cohort of rats, PR responding was evaluated following an acute pretreatment of (+)-naltrexone (0, 30, 60 mg/kg, ip) administered before the PR test session. The (+)-naltrexone pretreatment was administered as two half-dose injections (0, 15, 30 mg/kg, ip, respectively) administered 30 min apart to improve pharmacokinetic absorption and distribution. Rats were tested following vehicle and either 30 mg/kg or 60 mg/kg (+)-naltrexone with the order of the pretreatments counterbalanced to avoid ordering effects.

Experiment 3: Acute (+)-naltrexone before sucrose self-administration did not alter responding for sucrose

Prior work demonstrates that high doses of (+)-naltrexone inhibit the self-administration of food pellets (Tanda et al. 2016). Here, we assess whether (+)-naltrexone administered as 2 half-dose injections impacts the self-administration of a sucrose reward. Rats (n = 6) were placed on a limited diet with access to 15 g of standard rat chow per day and were trained to self-administer banana-flavored sucrose pellets (45 mg/pellet) on an FR1 schedule of reinforcement for 4 daily 2-h sessions. Following FR1 training, rats were advanced to an FR5 schedule of reinforcement for 4 daily additional 2-h sessions. Rats then underwent testing in four 2-h sessions where they received 2 injections of (+)-naltrexone (0, 7.5, 15, 30 mg/kg, ip) at 30-min intervals for a total of 0, 15, 30, or 60 mg/kg (+)-naltrexone. Immediately following the 2nd injection, rats were placed in operant chambers to self-administer sucrose pellets on an FR5 schedule during 2-h sessions. The order of the pretreatments counterbalanced to avoid ordering effects.

Experiment 4: Effect of TLR4 antagonism on cocaine seeking during abstinence

The effect of TLR4 antagonism was evaluated on the incubation of cue-seeking following cocaine self-administration. Rats acquired cocaine self-administration on an FR1 schedule for ten daily 2-h sessions similar to Exp. 1. During the 14-d forced abstinence period, (+)-naltrexone was chronically delivered via subcutaneous mini-pumps (ALZET, model 2ML2, Cupertino, CA) to allow a continuous dose of 15 mg/kg/day (+)-naltrexone or vehicle. Rats experienced a 30-min cue-seeking test session on withdrawal day (WD) 1 to assess initial cue-seeking and match group assignments (vehicle vs. (+)-naltrexone). Immediately after the test session, rats were implanted with the subcutaneous mini-pump. Rats were tested for the incubation of cue seeking in a 3-h session conducted on WD 14. During the cue-seeking tests, responding on the drug-paired lever resulted in the cue light illumination, but no cocaine delivery.

Experiment 5: Effect of TLR4 antagonism on cocaine-primed reinstatement of cocaine seeking following extinction training

The effect of TLR antagonism was tested on cocaine-primed reinstatement of extinguished cocaine seeking. Rats acquired cocaine self-administration on an FR1 schedule for ten 2-h daily sessions similar to Exp. 1 followed by seven daily 2-h extinction sessions. A pretreatment of (+)-naltrexone (0, 15, 30 mg/kg, ip) was administered as two half-dose injections (0, 7.5, 15 mg/kg, ip, respectively) administered 30 min apart before the 2-h drug-primed reinstatement of cocaine seeking session.

Experiment 6: Effect of nucleus accumbens TLR4 antagonism on cocaine-primed reinstatement of cocaine seeking following extinction training

The effect of TLR antagonism within the core and shell subregions of the nucleus accumbens was tested on cocaine-primed reinstatement of extinguished cocaine seeking. Following cocaine self-administration and extinction as in prior experiments, drug-primed reinstatement of cocaine seeking session was evaluated following a pretreatment of LPS-Rs (5 μg/side) administered into the NAc Shell or NAc Core. All rats were tested under control conditions (Vehicle/Saline).

Statistics

Statistical analyses were performed using the GraphPad Prism (v9.4 for Mac, San Diego, CA). A 2-factor mixed-design analysis of variance (ANOVA) with sessions and treatment as the factors was used to evaluate the impact of (+)-naltrexone administration on cocaine self-administration studies using the dependent measures of cocaine infusions and lever responding. Responding during extinction training was analyzed in a 3-factor mixed-design ANOVAs with sessions and lever as within-subjects factors and treatment as a between-subject factor. Drug-paired and inactive lever responding during the seeking tests were analyzed separately in ANOVAs with the session and treatment as the factors. Self-administration and extinction data include more than 3 repeated measures and may lack sphericity across the repeated sampling. The Geisser-Greenhouse correction was used to adjust the degrees of freedom in determining the critical F-value. Significant main and interactive effects were followed by simple main effects tests and post hoc tests (Tukey’s multiple comparison test). Statistical significance was determined as p < 0.05.

Results

Experiment 1: Effects of TLR4 antagonism on the acquisition of cocaine self-administration and subsequent extinction and cocaine seeking

This experiment assessed the direct effects of subcutaneous delivery of (+)-naltrexone on the acquisition and maintenance of cocaine self-administration and the latent effects of continuous (+)-naltrexone delivery during cocaine self-administration on cocaine seeking during extinction training and cue-induced and drug-primed reinstatement testing (Fig. 1a). No differences were observed in the acquisition of cocaine self-administration between the (+)-naltrexone (n = 10) and vehicle-treated (n = 7) groups (Fig. 1b, treatment effect: F1,15 <1, NS, session: F6.12,91.73 = 5.60, p <0.001, treatment × session interaction: F16,240 <1, NS). No effect of previous treatment with (+)-naltrexone on extinction responding was found, although rats in both groups pressed the active lever more frequently than the inactive lever during the initial extinction sessions (Fig. 1c, treatment: F 1,15 = 1.08, p = .32, lever: F.25, 3.7 = 36.35, p < .01, treatment × session × lever interaction: F6,90 = 2.15, p = .06). Rats in both groups displayed cue-induced cocaine seeking as indicated by greater active lever responding during the reinstatement compared with the extinction sessions (session: F1,15 = 15.41, p < .001, treatment × session: F1,15 < 1, NS). No differences were found between the rats treated with (+)-naltrexone and vehicle during self-administration on cue-induced reinstatement test (Fig. 1d, treatment: F1,15 < 1, NS). Rats in both groups also showed robust drug-primed reinstatement of cocaine seeking with greater active lever responding during reinstatement compared with extinction (session: F1,15 = 17.77, p < .001, treatment × session: F1,15 < 1, p = .35), but no group differences were found during the drug-primed reinstatement session (Fig. 1e, treatment: F1,15 < 1, NS).

Fig. 1
figure 1

Effects of continuous TLR4 antagonism via osmotic minipump on the acquisition of cocaine self-administration and subsequent extinction and cocaine seeking. a Timeline illustrating the experimental design. b Subcutaneous mini-pump delivery of (+)-Naltrexone for ~14 days (shaded region) did not alter the acquisition or maintenance of cocaine self-administration as indicated by the mean ± SEM number of cocaine infusions. c (+)-Naltrexone administration during the self-administration phase had no impact on subsequent extinction of previously drug-paired or inactive lever responding (mean ± SEM). Prior (+)-naltrexone administration during the self-administration phase had no impact on subsequent cocaine-seeking induced by cocaine-associated cues (d) or a cocaine priming injection (e). Data are the mean ± SEM number of drug-paired lever responses. Vehicle n = 7; (+)-naltrexone n = 10

Experiment 2: Effects of TLR4 antagonism on cocaine’s reinforcing properties

This experiment assessed PR operant responding for cocaine in rats treated with continuous (+)-naltrexone (15 mg/kg/day) via osmotic mini-pumps (Fig. 2a). After acquiring the FR1 response, rats were advanced to an FR5 schedule of reinforcement (Fig. 2b). One day after minipump implantation, rats self-administered cocaine on a PR schedule for 4 consecutive days. (+)-Naltrexone delivered through an osmotic mini-pump had no effect on cocaine infusions during PR responding (Fig. 2c, treatment: F1,13 <1, NS, session: F2.21, 28.72 = 2.92, p = .07), treatment × session: F4,52 <1, NS) or cumulative lever responses (Fig. 2d, treatment: F1,13 <1, NS, session: F1.69, 21.96 = 1.902, p = .18), treatment × session: F4,52 <1, NS). These findings indicate that continuous (+)-naltrexone does not alter the motivation to acquire cocaine in a PR testing procedure.

Fig. 2
figure 2

Effects of continuous TLR4 antagonism on the motivation to acquire cocaine using a progressive ratio schedule of reinforcement. a Timeline illustrating the experimental design. b Cocaine intake (mean ± SEM number of cocaine infusions) on an FR5 schedule was equivalent before mini-pump implantation for continuous delivery of (+)-naltrexone during progressive ratio testing. Continuous delivery of (+)-naltrexone (n = 7; shaded region) did not alter the number of cocaine infusions (c) or the number of cumulative active lever responses (d) during progressive ratio testing compared with the vehicle group (n = 8)

The effects of acute injections of (+)-naltrexone administered immediately before the PR session were also used to evaluate 2 additional (+)-naltrexone doses in a separate cohort of rats (Fig. 3a). After acquiring the FR1 response and being advanced to an FR5 schedule of reinforcement (Fig. 3b), rats then self-administered cocaine on a PR schedule. During the PR sessions, rats received counterbalanced acute injections of either vehicle or (+)-naltrexone (30 mg/kg or 60 mg/kg, ip). No effect of treatment dose was observed on PR infusions (Fig. 3c, treatment: F2,23 = 2.80, p = .081) or cumulative lever presses (Fig. 3d, treatment: F2,23 = 1.77, p = .19) suggesting that (+)-naltrexone did not alter the motivation to acquire cocaine using a PR testing procedure.

Fig. 3
figure 3

Effects of acute TLR4 antagonism on the motivation to acquire cocaine using a progressive ratio schedule of reinforcement. a Timeline illustrating the experimental design. b Cocaine intake (mean ± SEM number of cocaine infusions) on an FR5 schedule was stable before evaluating the effects of vehicle (n = 14), 30 mg/kg (+)-naltrexone (n = 8), and 60 mg/kg (+)-naltrexone (n = 6) on progressive ratio performance using a modified within-subject design. Acute administration of (+)-naltrexone failed to alter the number of cocaine infusions (c) or the cumulative active lever responses (d) during progressive ratio testing compared with the vehicle group

Experiment 3: Acute (+)-naltrexone before sucrose self-administration did not alter responding for sucrose

Prior work has demonstrated that (+)-naltrexone reduces operant responding for a food reward (Tanda et al. 2016). Here, we tested the effects of (+)-naltrexone on operant responding for sucrose using an administration procedure designed to maximize the bioavailability of (+)-naltrexone (Fig. 4a). Food-restricted rats were trained to self-administer sucrose pellets on an FR1 schedule of reinforcement for 4 daily 2-h sessions followed by an additional 4 sessions on an FR5 schedule of reinforcement. Rats (n = 6) then underwent testing over 4 days during which they received pretreatment of (+)-naltrexone (0, 15, 30, or 60 mg/kg, ip). Surprisingly, rats continued to respond robustly at the active lever for sucrose pellets regardless of the (+)-naltrexone pretreatment (Fig. 4, treatment: F3, 30 <1, NS, lever (F1,10 = 78.10, p <.0001), treatment × lever (F3,30 <1; NS). These findings indicate that (+)-naltrexone does not have a major impact on motivation to respond for natural rewards.

Fig. 4
figure 4

Effects of TLR4 antagonism on the self-administration of sucrose pellets. a Timeline illustrating the experimental design. b Sucrose responding during the FR5 sessions was stable and robust before evaluating the effects of vehicle or naltrexone (15, 30, and 60 mg/kg, ip) using a within-subject design (n = 6) on FR5 responding for sucrose pellets. c Acute administration of (+)-naltrexone failed to alter the number of lever responses (left) or the total sucrose consumed (right) compared with the vehicle treatment. Data are presented as the mean ± SEM number of active and inactive lever responses

Experiment 4: Effect of TLR4 antagonism on cocaine seeking during abstinence

This experiment assessed the effects of chronic subcutaneous osmotic mini-pump delivery of (+)-naltrexone during abstinence on cue-reinforced cocaine seeking (Fig. 5a). Rats self-administered cocaine during 15 consecutive 2-h daily sessions (Fig. 5b) and were tested during a 30-m cue-reinforced drug-seeking session on WD1. Rats were then implanted with subcutaneous osmotic mini-pumps delivering (+)-naltrexone (15 mg/kg/day) or vehicle and remained in the home cage. On WD14, rats were tested on a 3-h cue-reinforced drug-seeking session. A main effect of abstinence day was found indicating that cue-reinforced lever pressing during the first 30 min increased from WD1 to WD14 (session: F1,13 = 14.00, p < .01). However, no effect of (+)-naltrexone treatment was observed (Fig. 5c, treatment: F1,13 <1 , NS), treatment × session: F1,13 <1, NS). No effect of treatment was found on total lever presses during 1-h bins (Fig. 5d, treatment: F1,13 = 1.43, p = .25, time: F1.87, 24.28 = 24.13, p <.0001, treatment × Time: F2,26 <1, NS) or over the entirety of the 3-h session (Fig. 5e, treatment: t13 = 1.20, p = .25). These findings suggest the (+)-naltrexone administered during abstinence is not sufficient to reduce subsequent cocaine seeking similar to findings observed with another psychostimulant drug, methamphetamine (Theberge et al. 2013).

Fig. 5
figure 5

Effects of TLR4 antagonism during the abstinence phase on incubated cue-induced cocaine seeking. a Timeline illustrating the experimental design. b Cocaine intake (mean ± SEM number of cocaine infusions) was stable before the implantation of the osmotic mini-pump. c Cue seeking during the 30-m test on day 1 and the first 30-m of the 3-h test on day 14 indicate that cocaine-seeking (mean ± SEM number of drug-paired lever responses) increased to a similar magnitude in both the vehicle and (+)-naltrexone treated rats (*p < 0.05 compared with day 1). d Evaluation of hourly cocaine cue seeking (mean ± SEM number of drug-paired lever responses) on abstinence day 14 also indicated a similar distribution of seeking between the treatment groups. e Total responding (mean ± SEM number of drug-paired lever responses) during the 3-h cue-seeking test was not significantly different between the vehicle and (+)-naltrexone treated groups. Vehicle n = 7; (+)-naltrexone n = 8

Experiment 5: Effect of TLR4 antagonism on cocaine-primed reinstatement of cocaine seeking following extinction training

Inhibition of TLR4 receptors in the ventral tegmental area was previously shown to be sufficient to inhibit drug-primed reinstatement of cocaine seeking (Brown et al. 2018). Here, we assessed the effects of multiple (+)-naltrexone doses on drug-primed reinstatement of cocaine seeking (Fig. 6a). Rats were trained to self-administer cocaine for 15 consecutive 2-h daily sessions (Fig. 6b), and lever responding was extinguished in seven 2-h extinction sessions (Fig. 6c). Rats were then administered vehicle or (+)-naltrexone (15 or 30 mg/kg, ip) before a drug-primed reinstatement session or vehicle control reinstatement session. Pretreatment of 30-mg/kg (+)-naltrexone significantly reduced active lever presses during the drug-primed reinstatement session (Fig. 6d, cocaine-prime: F1,51 = 37.81, p < .0001, treatment: F2,51 = 3.66, p = .033, interaction: F2,51 = 3.71, p = .031). Post hoc Tukey’s comparison revealed that the 30-mg/kg (+)-naltrexone group was significantly reduced compared with the vehicle-treated group during drug-primed reinstatement session (t51 = 6.67, p < .001), although rats treated with 30 mg/kg (+)-naltrexone still displayed significant cocaine seeking compared with the vehicle group.

Fig. 6
figure 6

Effects of TLR4 antagonism on drug-primed reinstatement of cocaine seeking. a Timeline illustrating the experimental design. b Cocaine intake (mean ± SEM number of cocaine infusions) was stable before extinction training and reinstatement testing. c Rats were extinguished in seven daily 2-h extinction sessions. Data indicate the mean ± SEM number of drug-paired and inactive lever responses. d Rats were administered a priming injection of either cocaine (15 mg/kg, ip) or vehicle to elicit cocaine seeking. Before the priming injection, rats were administered either vehicle or (+)-naltrexone (15 or 30 mg/kg, ip). All pretreatment groups displayed significant drug-paired lever responding following the cocaine priming injection compared with a vehicle priming injection (*p < 0.05 compared with respective vehicle prime). Administration of 30 mg/kg (+)-naltrexone decreased cocaine-primed seeking compared with vehicle pretreatment (#p < 0.05). Veh/Veh n = 7; 15 mg/kg (+)-naltrexone/Veh n = 5; 30 mg/kg (+)-naltrexone/Veh n = 8; Veh/Cocaine n = 13; 15 mg/kg (+)-naltrexone/cocaine n = 6; 30 mg/kg (+)-naltrexone/cocaine n = 18

Experiment 6: Effect of nucleus accumbens TLR4 antagonism on cocaine-primed reinstatement of cocaine seeking following extinction training

Prior work demonstrates that the ventral tegmental area is a brain region where TLR4 is implicated in drug-primed cocaine seeking (Brown et al. 2018). We sought to extend those findings by assessing the effect of the TLR4 antagonist LPS-Rs delivered into the NAc Shell (n = 16) or NAc Core (n = 8) on drug-primed reinstatement (Fig. 7a). Rats self-administered cocaine in daily 2-h sessions (Fig. 7b), and lever responding was extinguished in 7 daily 2-h extinction sessions (Fig. 7c). Delivery of LPS-Rs (5 μg/side) into the NAc Shell reduced drug-primed reinstatement of cocaine seeking compared to the vehicle-treated group (NAc Shell and NAc Core pooled to form single vehicle group, n = 16) and LPS-Rs NAc Core group (Fig. 7d, treatment: F3,60 = 6.28, p < .001, lever: F1,60 = 38.53, p <.0001, treatment × lever: F3,60 = 8.71, p < .0001). The post hoc comparison indicates that the vehicle-treated group (t120 = 8.37, p <.0001) and LPS-Rs NAc Core treated group (t120 = 6.40, p < .0001) were both significantly different than the vehicle saline-prime control group (n = 24). These findings suggest that TLR4 in the NAc Shell, but not the NAc Core, plays a role in cocaine seeking induced by a cocaine priming injection.

Fig. 7
figure 7

Effects of TLR4 antagonism in the nucleus accumbens on drug-primed reinstatement of cocaine seeking. a Timeline illustrating the experimental design. b Cocaine intake (mean ± SEM number of cocaine infusions) was stable before extinction training and reinstatement testing. c Rats were extinguished in seven daily 2-h extinction sessions. Data indicate the mean ± SEM number of drug-paired and inactive lever responses. d Rats were administered a priming injection of either cocaine (15 mg/kg, ip) or vehicle to elicit cocaine seeking. Before the priming injection, rats were administered either LPS-Rs (5 μg/side) or vehicle in the NAc Shell or NAc Core. A cocaine priming injection induced significant drug-paired lever responding in rats receiving an intra-cranial vehicle injection or intra-NAc Core administration of LPS- Rs (*p < 0.05 compared with vehicle prime). Administration of LPS-Rs into the NAc Shell, but not the NAc Core, resulted in diminished cocaine-primed seeking (#p < 0.05 compared with vehicle pretreatment). e Histological plates illustrating the placements of the intra-NAc Shell and intra-NAc Core microinfusions. Rats with misplaced infusions (shell n = 2; core n = 2) were excluded from all analyses. Veh (core and shell)/Veh n = 24; Veh (core and shell)/cocaine n = 16; core LPS-Rs/cocaine n = 8; shell LPS-Rs/cocaine n = 16

Discussion

This study sought to further investigate the role of TLR4 in cocaine self-administration and cocaine-seeking using a pharmacological antagonist approach. Acute administration of the TLR4 antagonist (+)-naltrexone reduced drug-primed reinstatement of cocaine seeking. Bilateral intracranial administration of LPS-Rs into the NAc Shell also reduced drug-primed reinstatement suggesting the NAc Shell as a site where cocaine may interact with TLR4 to induce cocaine seeking. These results are consistent with the previous reports from our laboratory suggesting a role for the TLR4 system in cocaine-elicited drug seeking (Northcutt et al. 2015; Brown et al. 2018). The impact of TLR4 inhibition appears specific to drug-primed reinstatement since (+)-naltrexone administration during forced abstinence did not alter cue-reinforced cocaine seeking. This finding is similar to that observed on methamphetamine seeking when (+)-naltrexone was administered during abstinence (Theberge et al. 2013).

The TLR4 system has previously been investigated in rodent self-administration models. Results have generally indicated a role for TLR4 in drug reinforcement and seeking behaviors, but findings have been drug-specific and, in some studies, complicated by the generalization of TLR4 antagonist effects to non-drug reinforcers (e.g., sucrose reward). In psychostimulant drug reinforcement and reward-related models, (+) naloxone and (+)-naltrexone blocked methamphetamine and cocaine conditioned place preference and reduced cocaine operant self-administration (Northcutt et al. 2015; Wang et al. 2019). Similarly, a genetic approach using the C3H/HeJ mouse strain that possesses a spontaneous point mutation that impedes TLR4/NF-κB signaling, illustrates that C3H/HeJ TLR4-mutant mice self-administered less cocaine on both fixed- and progressive-ratio schedules of reinforcement (Poltorak et al. 1998; Northcutt et al. 2015). Both pharmacological and genetic approaches did not detect an effect of TLR4 manipulation during sucrose self-administration, suggesting specificity to cocaine-related behaviors (Northcutt et al. 2015). Interpretation of reduction in cocaine self-administration of C3H/HeJ mice warrants caution, however, as these mice have a homozygous 20% inversion of chromosome 6, limiting conclusions about the specificity of observed effects to TLR4. A more recent study found that TLR4 (−/−) mice exhibit reduced cocaine locomotor sensitization and conditioned place preference thresholds with no differences in novel object exploration, open-field locomotor activity, or sucrose preference between TLR4 knockout and wild-type mice (Kashima and Grueter 2017). Only one prior study has examined the effect of TLR4 blockade on the reinstatement of cocaine-seeking, a model of relapse-like behavior. In this study, intra-cranial administration of the TLR4 antagonist LPS-Rs into the VTA reduced drug-primed reinstatement of cocaine seeking, but the identical treatment did not alter reinstatement of sucrose seeking elicited by non-contingent delivery of sucrose pellets (Brown et al. 2018). In contrast to TLR4 blockade, activation of TLR4 appears to exert opposite behavioral effects. For instance, Tortorelli et al. (2015) found that activation of TLR4 with LPS resulted in cocaine locomotor sensitization to a sub-threshold dose of cocaine. Additionally, TLR4 activation with LPS in the VTA was sufficient to reinstate cocaine-seeking (Brown et al. 2018). Translational evidence linking early life stress and TLR4 neuro-immune signaling to the subsequent propensity for cocaine abuse indicates a dynamic interplay among neuro-immune function, development, and cocaine abuse (Lo Iacono et al. 2018). Together, these studies suggest a bi-directional relationship between TLR4 blockade, which generally reduces cocaine behaviors, and TLR4 activation, which appears to heighten cocaine sensitivity and initiate drug-seeking behaviors.

Our results support a role for TLR4 in the reinstatement of cocaine seeking, but do not support a role of TLR4 in cocaine reinforcement as reported previously (Northcutt et al. 2015; Tanda et al. 2016). Unlike the prior studies, the current studies implanted osmotic minipump to chronically deliver (+)-naltrexone. A similar discrepancy was observed with different types of TLR4 antagonist administration on opioid self-administration. Acute delivery of (+)-naloxone or (+)-naltrexone inhibited self-administration of remifentanil while chronic delivery of (+)-naltrexone via an osmotic minipump failed to alter heroin self-administration (Hutchinson et al. 2012; Theberge et al. 2013; Tanda et al. 2016; Yue et al. 2019). Chronic drug delivery via an osmotic minipump has several advantages over repeated injections including reduced animal stress, consistent drug levels, increased drug efficacy, and generally fewer side effects. However, constant drug delivery over about a 2-week period may produce adaptations within the TLR4 system and its downstream targets that render (+)-naltrexone less efficacious. Alternatively, other physiological adaptations such as metabolic tolerance may reduce the overall bioavailability of (+)-naltrexone when administered via an osmotic minipump. Many differences in pharmacokinetics and pharmacodynamics between these drug delivery routes likely play a role in these discrepant findings, limiting interpretation of the precise mechanisms that account for our observed differences in cocaine reinforcement. Future studies may benefit from assessing distinct molecular and synaptic consequences of TLR4 antagonists administered through different routes and time periods.

Concerns regarding the specificity of the effects of TLR4 pharmacological antagonists emerged following replication attempts in opioid and cocaine reinforcement models. Tanda et al. (2016) found that (+)-naltrexone reduced cocaine self-administration. However, reductions in sucrose self-administration were also observed, suggesting non-specific effects (Tanda et al. 2016). Conflicting findings regarding the effects of these pharmacological compounds on non-drug reward behaviors, including null results in this report, may point to discrepancies across various models and dosing regimens used by different laboratories. One notable difference between the present dosing regimen and those of Tanda et al. (2016) and Yue et al. (2019) is the timeframe during which (+)-naltrexone was delivered. In the current studies, (+)-naltrexone was administered via two doses 30 min apart or osmotic minipump to achieve a stable steady-state concentrations within the brain. This dosing regimen was selected because of previous findings that determined a short half-life (~1 h) for (+)-naltrexone (Northcutt et al. 2015). This strategy was also used in previous reports from Northcutt et al. (2015) and Zhang et al. (2018) with similar doses and efficacy. Other strategies to manipulate TLR4, including transgenic rodent and pharmacological methods, have found results specific to drug reward compared to natural reward, memory, and locomotor behaviors, all of which argue against the interpretation that TLR4 effects are generalized across behavioral outcomes.

The role of neuro-immune function in drug abuse has received greater attention recently, and investigations are not limited to TLR4. Widespread transcriptional and translational changes of neuroimmune-related genes have been found following cocaine self-administration in rodents and non-human primates (Vallender et al. 2017; Burkovetskaya et al. 2020). The breadth of cocaine-induced neuroimmune changes indicates the need to further investigate their involvement in drug abuse-related behaviors. Given widespread changes in neuroimmune gene expression following cocaine experience, it is not surprising that, to date, several pleiotropic immune signaling molecules have been found that alter psychostimulant behaviors and mesolimbic DA function (Calipari et al. 2018; Hofford et al. 2019; Gao et al. 2020). Similarly, other receptors, including subtypes of the TLR family and the endoplasmic reticulum stress pathway, appear to be involved in cocaine-induced inflammation (Guo et al. 2015; Liao et al. 2016). Finally, neuroimmune interactions are not limited to psychostimulants, as indicated by studies in alcohol, opioid, and nicotine drug models (Hutchinson et al. 2012; Theberge et al. 2013; Northcutt et al. 2015; Crews et al. 2017; Linker et al. 2020). Together, this work highlights the potential importance of neuro-immune interactions in cocaine abuse and the complexity of using immune-related treatment approaches.