Introduction

Drug addiction is defined as a chronic disorder characterized by relapse accompanied by the compulsion to seek and take the drug and the loss of control in limiting intake (Koob and Le Moal 1997). When access to the drug is prevented, a negative emotional state emerges, reflecting a motivational withdrawal syndrome (Koob and Volkow 2010). Due to this negative emotional state, drugs of abuse become negative reinforcement during withdrawal (Koob and Le Moal 2001), leading to relapse due to the impelling need to consume (Koob and Volkow 2010). Therefore, it is not surprising that patients who undergo an addiction treatment tend to seek alternative reinforcements, including natural rewards, such as palatable food (high-fat and/or sugar-rich food), in order to stimulate brain circuits of reward (Salamone et al. 2005). In fact, clinical evidence emphasizes the frequent use of palatable food to decrease drug craving during withdrawal (Cowan and Devine 2008).

Preclinical studies have also pointed to this relation. Orsini et al. (2014) reported that rats with a history of chronic amphetamine exposure increased their food consumption. In addition, Loebens and Barros (2003) observed that animals fed with a high-fat diet (HFD) are more prone to depression during cocaine withdrawal in the forced swimming test (Loebens and Barros 2003). Moreover, other studies suggest that fat intake may represent a competitive reward for drugs. The conditioned place preference (CPP) procedure evaluates the role of environmental cues associated with the rewarding effects of drugs of abuse, such as cocaine, which were decreased by previous exposure to HFD (Morales et al. 2012). In this line, the present work is a follow-up of our previous study reporting that continuous HFD administration during the extinction of cocaine-induced CPP reduced the sessions required to extinguish the preference and decreased the sensitivity to drug priming-induced reinstatement (Blanco-Gandía et al. 2017a).

Epidemiological studies have shown high levels of comorbidity between eating disorders (bulimia and binge eating disorder) and substance abuse (Becker and Grilo 2016; Conason et al. 2006; Holderness et al. 1994; Nøkleby 2012; Flores-Fresco et al. 2018). We speculate that this high comorbidity could in part be due to shared reinforcing properties between palatable foods and drugs of abuse. For example, certain foods, particularly those rich in sugar and fat, are potent rewards that promote eating even in the absence of energetic requirements (Lenoir et al. 2007), being able to trigger learned associations between environmental stimulus and reward (Volkow et al. 2011). In fact, several studies have pointed out that a continuous access to fat diminishes the rewarding effects of cocaine (Morales et al. 2012; Thanos et al. 2010; Blanco-Gandía et al. 2017a). Palatable food activates the reward system (DiLeone et al. 2012; Narayanaswami et al. 2013) through the activation of the mu-opioid receptor pathway in the VTA (Pitman and Borgland 2015) and the cannabinoid system (Parylak et al. 2012).

The present work relates to the idea of palatable food as an alternative reward. In the previous studies, a continuous high-fat diet produced harmful metabolic effects (Blanco-Gandía et al. 2017b). Here, we aim to study if an intermittent and limited access to a HFD could also diminish cocaine-associated memories without causing changes on bodyweight or metabolism, proving that a sporadic exposure could be sufficient to extinguish the preference and block reinstatement. Our study will be the first to evaluate the possible counteracting effects of intermittent and limited access to a HFD on the extinction and reinstatement of cocaine-induced CPP. Our general hypothesis is that the limited access to a HFD will accelerate the extinction of cocaine-associated memories and will reduce cocaine priming reinstatement. To assess metabolic disturbances, we will measure bodyweight and leptin and ghrelin changes after HFD exposure. Because the opioid and cannabinoid systems play a crucial role in food and drug reward (de Macedo et al. 2016), we also evaluated the effects of HFD administrations on the opioid mu receptor (Oprμ) and CB1 receptor gene expression (CB1r) in the striatum. Oprμ and CB1r are implicated in food reward processes and palatability (Kessler et al. 2016; Bello et al. 2014), as well as in various forms of learning and memory, including acquisition and reinstatement of cocaine-associated memory (Sticht et al. 2010; Hu et al. 2015). Given that sex differences in the vulnerability to drug seeking have been scarcely studied (Carroll et al. 2002), we will conduct our study in both male and female mice. For example, several data suggest a more intense response to cocaine in female rodents, with more psychomotor sensitization (Holly et al. 2012), faster acquisition of cocaine self-administration (Martini et al. 2014), and higher a breaking point of the progressive ratio and cocaine reinstatement (Lynch 2006). Sex-specific differences have also been described in response to HFD, with male rodents being more susceptible to physiological changes (Grove et al. 2010; Mela et al. 2012; Wang et al. 2018; Gelineau 2017).

Material and methods

Subjects

A total of 60 female and 47 male mice of the OF1 outbred strain were acquired commercially from Charles River (Barcelona, Spain). Animals were 42 days old when they arrived at the laboratory and were all housed under standard conditions in groups of 4–5 (cage size 28 × 28 × 14.5 cm). Mice were exposed to a reverse light cycle (white lights on from 19:30 to 7:30), and the vivarium was controlled for constant temperature (21 ± 2 °C). Food (standard diet) and water were available ab libitum except during the behavioral tests. All procedures involving mice and their care complied with national, regional, and local laws and regulations, which are in accordance with Directive 2010/63/EU of the European Parliament and the council of September 22, 2010, on the protection of animals used for scientific purposes. The Animal Use and Care Committee of the University of Valencia approved the present study.

Drug treatment

For CPP, animals were injected intraperitoneally (IP) with 10 mg/kg of cocaine hydrochloride (Laboratorios Alcaliber S.A., Madrid, Spain) diluted in 0.9% NaCl (saline) in a volume of 0.1 mL/10 g bodyweight.

Experimental design

Animals first underwent the 10 mg/kg cocaine-induced CPP procedure from postnatal day (PND) 43. From PND 53 onwards, all mice were, from this moment on, exposed twice a week to extinction sessions (Table 1). Female and male mice were randomly divided into four groups: standard diet (SD); daily high-fat diet (HFD-24h); Monday, Wednesday, and Friday high-fat diet (HFD-MWF); and high-fat diet 1h before extinction (HFD-Ext).

Table 1 Experimental design. PND postnatal days, HFD high-fat diet

Apparatus and procedure

Conditioned place preference

For place conditioning, we employed sixteen identical Plexiglas boxes with two equally sized compartments (30.7 cm length × 31.5 cm width × 34.5 cm height) separated by a gray central area (13.8 cm length × 31.5 cm width × 34.5 cm height). The compartments have different colored walls (black vs white) and distinct floor textures (fine grid in the black compartment and wide grid in the white one). Four infrared light beams in each compartment of the box and six in the central area allowed the recording of the position of the animal and its crossing from one compartment to the other. The equipment was controlled by two IBM PC computers using MONPRE 2Z software (CIBERTEC S.A., Spain).

Acquisition of CPP

The procedure of place conditioning, unbiased in terms of initial spontaneous preference, was performed as described previously (Maldonado et al. 2006) and consisted in three phases. To summarize the main aspects, in the first phase, known as Pre-C, mice were allowed access to both compartments of the apparatus for 15 min (900 s) per day for 3 days. On day 3, the time spent in each compartment over a 900-s period was recorded, and animals showing a strong unconditioned aversion (less than 33% of the session time) or preference (more than 67%) for any compartment were excluded for the rest of the experiment (total excluded: 4). Half of the animals in each group received the drug or vehicle in one compartment and the other half in the other compartment. After assigning the compartments, no significant differences were detected between the time spent in the drug-paired and vehicle-paired compartments during the pre-conditioning phase. In the second phase (conditioning), which lasted 4 days, animals received an injection of physiological saline immediately before being confined to the vehicle-paired compartment for 30 min. After an interval of 4 h, they received an injection of cocaine immediately before being confined to the drug-paired compartment for 30 min. Confinement was carried out in both cases by closing the guillotine door that separated the two compartments, making the central area inaccessible. During the third phase, known as Post-C, the guillotine door separating the two compartments was removed (day 8), and the time spent by the untreated mice in each compartment during a 900-s observation period was recorded. The difference in seconds between the time spent in the drug-paired compartment during the Post-C test and the Pre-C phase is a measure of the degree of conditioning induced by the drug. If this difference is positive, then the drug has induced a preference for the drug-paired compartment, while the opposite indicates that an aversion has been developed.

Extinction of CPP

When the preference for the drug-paired compartment was established, mice underwent twice a week (Monday and Thursday) an extinction session that consisted of placing the animals in the apparatus (without the guillotine doors separating the compartments) for 15 min. The extinction condition was fulfilled when there was a significant difference between CPP scores and Post-C scores in two consecutive sessions and a lack of significant difference between CPP and Pre-C test values.

Reinstatement of CPP

Twenty-four hours after extinction had been confirmed, the effects of a priming dose of cocaine were evaluated. Reinstatement tests were the same as those carried out in Post-C (free ambulation for 15 min), except that animals were tested 15 min after administration of the respective dose of cocaine. When reinstatement of the preference was achieved, after a subsequent weekly extinction process, a new reinstatement test was conducted with progressively lower doses of the drug until the CPP was completely extinguished. This procedure of extinction-reinstatement was repeated with decreasing doses (half the previous dose) until a priming dose was confirmed to be ineffective (5 mg/kg after extinction of CPP induced by 10 mg/kg and 2.5 mg/kg after extinction if animals showed reinstatement with 5 mg/kg). HFD conditions were maintained until the end of the experiment. Priming injections were administered in the vivarium, which constituted a non-contingent place to that of the previous conditioning procedure.

Feeding conditions

After the Post-C test in the CPP, each group began their intermittent access to the different HFD patterns. Our feeding procedure is based on the limited access model described by Corwin et al. (1998), in which non-food-deprived animals have sporadic and limited access to a HFD. Two different types of diet were used in this study: a standard diet (Teklad Global Diet 2014, 13 Kcal % fat, 67 Kcal % carbohydrates, and 20 Kcal % protein; 2.9 kcal/g) and the HFD (TD.06415, 45 Kcal % fat, 36 Kcal % carbohydrates and 19% Kcal protein; 4.6 kcal/g). Both diets were supplied by Harlan Laboratories Models, S. L. (Barcelona, Spain) and will be referred to from now on as the standard diet and the high-fat diet (HFD).

As we described on Table 1, on postnatal day (PND) 53, mice were randomly divided into 4 groups of males and 4 groups of females with similar average bodyweights and assigned to one of the following conditions: standard diet (SD); daily high-fat diet (HFD-24h); Monday, Wednesday, and Friday high-fat diet (HFD-MWF); and high-fat diet before extinction (HFD-Ext). All the groups had standard diet access during the whole procedure, and the HFD-24h group had a 1-h access to HFD from Monday to Friday; the HFD-MWF had a 2-h access to HFD on Monday, Wednesday, and Friday; and lastly, animals in the HFD-Ext had a 1-h HFD access prior to each extinction session (twice a week). The SD group remained undisturbed in their home cage.

All groups were fed ad libitum with the standard diet (Teklad Global Diet 2014) in their own cages, and water was freely available. After acquiring cocaine preference, during the extinction process, animals continued with SD in their home cages, but only the animals in the HFD groups were taken for a limited time into a separated plastic cage with access to the HFD. Exposure to the HFD took place in different individual plastic cages 2–3 h after the beginning of the dark phase. HFD intake was measured after each session. Animals were weighed every week throughout the study, at which point their intake of the standard diet in their home cage was also measured in grams.

Determination of plasma leptin and ghrelin levels

For leptin and ghrelin plasma quantification, an ELISA kit was employed for leptin and ghrelin (Merck-Sigma Aldrich, Saint Louis, USA) following the manufacturer’s instructions. The sensitivity of the test is 0.2. All samples were run in duplicate.

Gene expression analyses: RNA isolation and quantitative RT-PCR

At the end of the experiments, animals were euthanized by cervical dislocation, and the brains were immediately removed from the skull and placed on a cold plaque. Cerebellum and olfactory bulbs were eliminated, and the striatum was dissected. Brain tissue samples were immediately stored at −80°C until the RT-PCR assay was performed. The whole striatum was employed for this analysis.

Total RNA from the striatum was isolated using the Tri Reagent Method (Sigma-Aldrich, St. Louis, MO, USA), as described in the manufacturer’s protocol. Reverse transcription of 1 mg of total RNA was performed using the Transcriptor First Strand cDNA synthesis kit (Thermo Fisher Scientific, Madrid, Spain). Amplification of the target and housekeeping (b-glucuronidase) genes was performed using the Taqman Gene Expression Master Mix (Thermo Fisher Scientific, Madrid, Spain) in a LightCycler 480 System (Roche Diagnostics) following the manufacturer’s instructions. The assay codes of the primers used are Mm01212171_s1, Mm01188089_m1, and Mm00446953_m1 corresponding to CB1r, Oprμ, and Gusb, respectively. Data were analyzed using the LightCycler 480 relative quantification software and were normalized to the amplification product of b-glucuronidase.

Statistics

To test for the acquisition of CPP, the time spent in the drug-paired compartment was analyzed with a (2)×2×4 mixed ANOVA. The two between factors were sex (male and female) and diet (SD, HFD-24h, HFD-MWF, and HFD-Ext). The within subjects factors tested the measurement differences between the pre- and post-CPP phases (Pre-C and Post-C). Since at this point of measurement all the animals had been maintained with the standard diet, the diet factor simply functions as a control check for the effectiveness of the random assignment to produce equivalent groups prior to exposure to the extinction-reinstatement manipulation (i.e., the expectation is that there will not be an effect for Diet). To compare whether extinction/reinstatement is achieved within the same group, data related to extinction, 2.5 mg/kg reinstatement, and 5 mg/kg reinstatement were analyzed by means of Student’s t-test. The time required for the preference to be extinguished was analyzed by means of the Kaplan-Meier test, with Breslow (generalized Wilcoxon) comparisons when appropriate. The extinction analyses were performed within every group; therefore, no comparisons among groups were made. Ghrelin and leptin plasma levels and CB1r and Oprμ gene expression values were analyzed by a one-way ANOVA, considering the between variable –Sex–, with two levels (male and female), and the between variable –Diet–, with 4 levels (SD, HFD-24h, HFD-MWF, HFD-Ext). All data are presented as mean ± standard error of mean (SEM). A p-value <0.05 was considered statistically significant. Analyses were performed using SPSS v26.

Results

Information relating to bodyweight and food intake can be found in the supplementary material S1. No bodyweight and food intake differences were found between the four diet groups within each sex group.

Conditioned place preference

Results of the 10 mg/kg cocaine-induced CPP are presented in Fig. 1a (males) and b (females). Regarding the main effects, the ANOVA for the time spent in the drug-paired compartment revealed an effect of the variable days [F(1,99)=138,410; p<0.001], which indicates that all animals developed conditioned place preference (Post-C vs Pre-C). Bonferroni’s post hoc comparisons showed that in all cases, males and females spent significantly more time in the drug-paired compartment in Post-C than in Pre-C (p<0.001 in all cases). No effect was found in the variables sex nor diet separately. Regarding the interactions, there was no significant effect in the three-way interaction days × sex × diet, but the ANOVA did reveal a significant effect of the interaction days × sex [F(1,99)=3833; p<0.05]. Bonferroni’s post hoc comparisons showed that female mice spent more time in the drug-paired compartment in Post-C than male mice (p<0.05), showing a stronger CPP but only in two groups: those that will receive the HFD tree days a week (HFD-MWF) or previously to each extinction sessions (HFD-Ext).

Fig. 1
figure 1

Effects of HFD during the extinction-reinstatement process in conditioned place preference (CPP) in males (a) and females (b). CPP was induced by 10 mg/kg of cocaine in male and female mice fed with a standard diet and exposed after Post-C to different dietary conditions: SD group (standard diet throughout the procedure), HFD-24h group (1-h access to HFD from Monday to Friday), in HFD-MWF group (2-h access to high-fat diet on Monday, Wednesday, and Friday), and in the HFD-Ext group (1-h high-fat diet access twice a week, prior to extinction sessions). Bars represent the time (± SEM) in seconds spent in the drug-paired compartment in the pre-conditioning test (white bars), the post-conditioning test (black bars), in the last extinction session (light gray bars), and in the reinstatement test (dark gray bars). The first reinstatement test was evaluated 15 min after a priming dose of 5 mg/kg of cocaine, while the second reinstatement test was evaluated 15 min after a priming dose of 2.5 mg/kg of cocaine. ***p<0.001 significant difference with respect to the Pre-C (ANOVA); +p<0.05 significant difference in female Post-C vs male Post-C. #p<0.05 significant difference with respect to extinction (Student’s t test)

To compare if there was a reinstatement into drug seeking after achieving extinction within the same group, Student’s t-tests were performed. Reinstatement with a priming dose of 5 mg/kg cocaine (Fig. 1a) was achieved in the male SD group (t=−2.772; d.f. 11; p=0.018), but after subsequent extinction sessions, the Student’s t-test showed that no reinstatement with a priming dose of 2.5 mg/kg cocaine was achieved (t=−0.078; d.f. 11; p=0.939). In females (Fig. 1b), the Student’s t-test showed reinstatement with a priming dose of 5 mg/kg cocaine in SD (t=−2.297; d.f. 14; p=0.038), HFD-24h (t=−2.529; d.f. 14; p=0.024), and HFD-Ext (t=−2.282; d.f. 14; p=0.039). Extinction was confirmed with the Student’s t-test, where no group of males and females showed any differences with respect to the Pre-C test (data not shown). After two subsequent extinction sessions, the Student’s t-test showed that no reinstatement with a priming dose of 2.5 mg/kg cocaine was achieved in any group (SD: t=−1.241; d.f. 14; p=0.235; HFD-24h: t=−0.684; d.f. 14; p=0.505; and HFD-Ext: t=−0.303; d.f. 14; p=0.767).

With regard to the time required to extinguish the preference, we found that in males (Fig. 2), the SD, HFD-MWF, and HFD-24h groups required a total of 16, 21, and 17 sessions, respectively, to achieve extinction, while HFD-Ext required only 8 sessions to extinguish the preference. The Kaplan-Meier analysis showed that the HFD-Ext male group required significantly less time to achieve extinction than the SD group (χ2=4.336; p<0.05), with no significant differences with respect to the other groups.

Fig. 2
figure 2

Mean of extinction sessions. The bars represent the mean value (± SEM) of the number of sessions required for the preference to be extinguished after the Post-C test. Preference was considered to be extinguished when an animal spent 370s or less in the drug-paired compartment on two consecutive days. When the preference was not extinguished in a mouse, the number of days needed to achieve extinction in the whole group was assigned to that animal. *p<0.05, **p<0.01 with respect to the respective standard diet group (SD); #p<0.05 with respect to HFD-MWF and HFD-Ext

In female groups, the SD, HFD-MWF, and HFD-Ext groups required 9 sessions to achieve extinction, while the HFD-24h group required a total of 15 sessions to achieve extinction (Fig. 2). The Kaplan-Meier analysis revealed that the HFD-24h group required significantly more time to achieve extinction than the SD group (χ2=8.431; p<0.01), HFD-MWF (χ2=4.250; p<0.05), and HFD-Ext (χ2=4.289; p<0.05).

Effects of different HFD eating patterns and sex on plasma leptin and ghrelin levels and CB1r and Oprμ gene expression

There were no significant effects of the different intermittent eating patterns or sex on circulating leptin [F(3,56)=0.975; p=0.411] and ghrelin levels [F(3,56)=0.440; p=0.725) (Table 2). Regarding the real-time PCR analyses, the ANOVA indicated in the CB1r gene expression (Fig. 3a) an effect of the interaction diet × sex [F(3,56)=3.340; p<0.05]. There was an increased CB1r gene expression only in males of the HFD-Ext group with significant differences with respect to the SD group (p<0.01), HFD-MWF group (p<0.01), and HFD-24h group (p<0.001). Moreover, the HFD-Ext male group showed a significant increase against their corresponding HFD-Ext female group (p<0.05).

Table 2 Plasma leptin (ng/ml) and ghrelin (pg/ml) levels. Data are presented as mean values ± SEM (n=8/condition)
Fig. 3
figure 3

Real-time PCR gene expression in the striatum. (a) CB1r relative gene expression evaluation in the striatum region (n = 8/condition). (b) Oprμ relative gene expression evaluation in the striatum region (n=8/condition). The columns represent means and the vertical lines ± SEM of relative (2-ΔΔCt method) gene expression in the striatum of OF1 mice. **p<0.01 significant differences with respect to the rest of the groups. +p<0.05; ++p<0.01 significant differences with respect to their corresponding female group

Regarding Oprμ gene expression (Fig. 3b), the ANOVA revealed a significant effect in the variable sex [F(1,55)=7,604; p<0.01], as overall, HFD male mice overexpressed Oprμ with respect to females (p<0.01).

Discussion

The present study noticed that the intermittent and limited access to a HFD may prevent reinstatement of the conditioned rewarding effects of cocaine in males, and specific administration schedules can facilitate or disrupt the extinction of the cocaine-associated memories in males and females. Related to this, this study highlights the important role of sex in these effects, since there were great differences between the results obtained in male and female mice. Once 10 mg/kg cocaine-induced CPP was acquired, all the mice were exposed to a different intermittent administration of HFD. The main results of the present work indicate that in males, exposure to the HFD in any of the three patterns employed blocked the reinstatement of the preference with a priming dose of 5 mg/kg of cocaine. However, in female mice, only one HFD administration (HFD-MWF) was effective in blocking the reinstatement of cocaine-induced preference. Extinction of the memories associated with cocaine was faster when access to the HFD was prior to each extinction session, but this phenomenon occurred only in males.

As we mentioned in the “Introduction,” this is a follow-up of a previous study where we confirmed that a continuous HFD administration during cocaine withdrawal undermined reinstatement of cocaine-induced CPP, thereby acting as an alternative reward. However, that administration pattern induced several metabolic consequences (Blanco-Gandía et al. 2017a). Therefore, the aim of the present study was to evaluate a reduced administration pattern of HFD that could have effects on cocaine reward without inducing metabolic disturbances, such as increased bodyweight or changes in circulating leptin or ghrelin levels. We have shown that the metabolic harm induced by HFD is not necessary to disrupt the extinction and reinstatement of the CPP induced by cocaine, given that a limited and intermittent exposure was sufficient to block cocaine-related effects in a sex-specific manner. Although the HFD did not modify these parameters or affect the Oprμ gene expression, we found changes in the CB1r gene expression in the striatum of male animals. Surprisingly, we observed that only males that showed a faster extinction of cocaine-related memories (HFD-Ext) exhibited an overexpression in CB1r.

Conditioned rewarding effects of cocaine

Most of the studies performed to date focus on the role of palatable food on the acquisition of the self-administration/CPP of drugs (Davis et al. 2008; Morales et al. 2012; Blanco-Gandía et al. 2017a, 2017b). Prevention of the reinstatement of cocaine self-administration in rats has been obtained by pairing cocaine-related stimuli with food (Kearns and Weiss 2007). In that study, one of the groups was submitted to extinction by pairing a tone with the presentation of food in a different context. When this group was exposed to the tone in the original context, their renewal was significantly lower than that observed in the group exposed to the tone but without alternative reward to cocaine. Kearns and Weiss (2007) suggested that pairing a drug-related stimulus, such as a tone, to another reward different from the drug enhances the reduction of the reinstatement into drug seeking and prevents relapse. To date, no data regarding food administration before being exposed to a drug-related context are available. In our study, we evaluated for the first time how intermittent access to HFD may modulate the extinction/reinstatement process.

We have previously shown that ad libitum access to a HFD accelerated the extinction process and blocked the reinstatement of cocaine seeking in adult mice (Blanco-Gandía et al. 2017a). However, an important weakness of that study was that continuous access to HFD induced bodyweight gain and dysregulation of hormone levels, with an increase in leptin and a decrease in ghrelin signaling, pointing to a metabolic syndrome and discarding the possibility of employing palatable food as an alternative reward to cocaine. However, in the present study, none of the intermittent and limited access to HFD schedules employed affected bodyweight, confirming that limited access to a HFD does not promote obesity (Corwin et al. 1998; Hudson et al. 2007; Blanco-Gandía et al. 2017b). In addition, there were no changes in leptin or ghrelin signaling in any of the groups, independently of HFD administration. These results confirmed previous studies (Blanco-Gandía et al. 2017b), showing that intermittent access to palatable food does not induce the negative consequences that were observed after ad libitum access (Davis et al. 2008; Morales et al. 2012; Blanco-Gandía et al. 2017a).

In agreement with previous reports indicating sex differences in the response to cocaine, we have observed a marked sex difference in the establishment of cocaine-induced CPP. Female mice spent more time in the drug-paired compartment during Post-C than males, suggesting a stronger sensitivity to the rewarding effects of cocaine. In line with our results, female rodents exhibit an increased sensitization to the locomotor effects of cocaine (Holly et al. 2012), faster acquisition of cocaine self-administration with persistence in the progressive ratio schedule (Martini et al. 2014; Lynch 2006), and develop cocaine-induced CPP with less pairing sessions than males (Russo et al. 2010).

However, after Post-C, female mice took less time to extinguish the preference than males, which suggests that cocaine-related memories are stronger in males. These results are in the line with previous studies showing that female mice require fewer extinction sessions than males to extinguish the CPP (Hilderbrand and Lasek 2014) and that estradiol administration facilitates extinction of cocaine CPP in female rats (Twining et al. 2013). However, no positive effects of the diet on the extinction process were observed in females, conversely to the results in males. Male mice exposed to HFD prior to each extinction session (HFD-Ext) exhibited a faster extinction of the drug-related memories. The HFD-Ext male group required significantly fewer sessions than the control group as well as fewer sessions than the HFD-MWF and HFD-24h groups. This effect could be related to the administration of HFD before every exposure to the context associated with cocaine reward. This is supported by previous data, where it has been suggested that pairing food with the old contextual cues related to the drug can prevent relapse (Kearns and Weiss 2007). An additional explanation is that the more rapid extinction in the HFD-Ext male group could be due to satiety and not be specific to HFD, as some studies have found that caloric restriction lowers the threshold dose for cocaine CPP and increases the persistence of extinction (Zheng et al. 2012; Jung et al. 2016). Thus, we cannot rule out that satiation in this group modulates the extinction process.

Hence, it can be argued that in order to diminish drug seeking in male mice, the moment in which HFD is administered plays a key role. Extinction recruits a new learning process (Lattal et al. 2006; Nic Dhonnchadha et al. 2013), especially in the hippocampus, which is particularly involved in the extinction of drug-associated memories (Szalay et al. 2013). Exposure to fat before each extinction session became a stimulus that predicted a non-reinforced context without cocaine and therefore accelerated the extinction of the preference. This effect is clear in males (HFD-Ext), but not in females, maybe due to their faster extinction of the preference. It is important to note that several studies have shown that HFDs lead to impairments in cognitive function such as memory or learning (Hwang et al. 2010; Valladolid-Acebes et al. 2013). In a previous study from our laboratory, we evaluated if a limited and intermittent access (exposed to the same conditions as the HFD-MWF group in this study) produced comparable impairments as the continuous access to a HFD (Blanco-Gandía et al. 2019). Our results showed that animals exposed to a limited and intermittent access to HFD showed no differences with respect to animals fed with a standard diet, which supports that the data obtained in the male HFD-Ext group are not due to a recall impairment.

As expected, 5 mg/kg of cocaine induced reinstatement in male and female mice fed with the standard diet. All three of the HFD administrations were effective in males, which did not reinstate their preference after a priming dose of cocaine. With a similar or lower number of extinction sessions than the control group, none of the male groups exposed to HFD reinstated the preference, suggesting that intermittent and limited HFD administration is a good alternative reinforcer. Therefore, the fact that the HFD blocked the reinstatement of preference in males, in all the schedules employed, independently of its administration schedule, suggests that fat could also act as an alternative reinforcer competing with cocaine.

Different results were observed in female mice, with only one group not reinstating the preference with 5 mg/kg cocaine, the female HFD-MWF group. The sex differences found in the response to HFD could be explained through the fact that female rats exhibited a higher cocaine priming-induced reinstatement response than males (Lynch 2006) with a greater magnitude of reinstatement to cocaine-induced CPP (Bobzean et al. 2010). Several studies have suggested that the neural systems mediating cocaine reinforcement could also show sex differences, with dopamine response induced by cocaine in several brain areas being greater in female (Becker 1999; Becker and Ramirez 1981; Walker et al. 2001). Results even showed that this higher dopaminergic sensitivity in females could be independent of gonadal hormones (Bazzett and Becker 1994; Castner et al. 1993; McDermott et al. 1994). However, the remaining question is why only the HFD-MWF pattern was a protective pattern to reinstate cocaine preference in females. Firstly, it was only in this group where females ate significantly more fatty food. Secondly, we have previously shown that administration for several weeks is a risky pattern that induces neurobiological changes similar to those produced by chronic drug administration and could interfere in the reward system (Blanco-Gandía et al. 2017a; Corwin et al. 1998; Puhl et al. 2011). We can hypothesize that HFD-MWF females were protected from reinstatement into cocaine seeking because they may have developed another preference for HFD. Some studies have reported that, after drug withdrawal, there is increased overeating, and it is even recommended to counteract craving (Bane et al. 1993; Orsini et al. 2014). In this context, authors propose the concept of “addiction transfer,” where one addiction is replaced by another, and could explain the behavioral outcomes exhibited by the HFD-MWF female group (Chechlacz et al. 2009).

Based on our results, we hypothesize that the administration of a natural reward, such as food, was not enough to block the potent effect of cocaine on female mice. However, in the case of males, who acquired the preference with less intensity than females, HFD became a good alternative reinforcer. Although preclinical studies are limited, several reports show that drug withdrawal induces an increase in food consumption. Orsini et al. (2014) showed that chronic exposure to amphetamine (9 injections) increased food consumption in male rats after cessation, discarding the possibility of a rebound from amphetamine-induced anorexia, as all the animals, control and amphetamine-treated, weighted the same when withdrawal began. In the same line, there is a reduction in the rewarding properties of drugs when a HFD is administered. For example, Wellman et al. (2007) demonstrated that a free access to HFD for 45 days diminished the acquisition of cocaine self-administration in male rats. Other findings on female rats showed that, after a 14-day exposure to cocaine, a specific increase in fat and carbohydrate consumption occurred, which was not seen in protein consumption (Bane et al. 1993). Most of these studies have been performed in male rodents, and our results highlighted the necessity of studying the response to palatable food in females as well as the limitation of the abovementioned results.

The different schedules of intermittent accesses to the HFD caused animals to receive a different number of rewarding experiences, which can be considered a limitation of this study. Those groups needing more sessions to extinguish the preference consequently were exposed to a higher number of fat administrations. However, a lack of reinstatement did not correlate to the number of HFD sessions in male or female mice. In addition, another important limitation to take into account in future studies is the variety of manipulations per week during the extinction process, given that, during this period, the SD groups were only moved to perform the extinction sessions, while the HFD groups were also moved for HFD access.

Neurobiology changes induced by intermittent and limited access to HFD

The endocannabinoid and the endogenous opioid systems are crucial in the addiction process and regulate feeding behaviors (Kessler et al. 2016; Bello et al. 2014). Opioid signaling is closely related to the rewarding properties of food, regulating palatability (Esch and Stefano 2004). For example, a continuous access to a HFD induces a significant reduction in Oprμ gene expression in the VTA (Blendy et al. 2005; Vucetic et al. 2011). Moreover, some studies point that CB1 activation in the NAcc and VTA modulates both dopaminergic and opioidergic pathways (Mellis et al. 2007). CB1 receptor antagonists are capable of reducing binge eating and mediate the extracellular dopamine release produced by a HFD (Parylak et al. 2012; Mellis et al. 2007). All these results support the idea that intermittent fat ingestion can modify reward pathways through different systems such as the opioid and cannabinoid systems. While the endogenous opioid system is related to the hedonic properties of food and modulates the release of DA-anticipating food, the endocannabinoid system is related to the homeostatic control of intake and positive feedback on the specific intake of fatty food (Koch 2001). In contrast with the numerous changes in gene expression induced by ad libitum administration of HFD (Blanco-Gandía et al. 2017a), the intermittent and the limited access to HFD during the extinction process practically did not induce variations. HFD administration did not induce any changes in the Oprμ gene expression, with the exception of a higher expression in males with respect to females. Results from previous studies are still controversial and depend on many factors, as some studies have reported that intermittent access to HFD for several weeks decreases mRNA expression of the Oprμ receptor in the NAcc in male mice (Blanco-Gandía et al. 2017b), while others reported the same reduction after a continuous access to a HFD or cafeteria diet in male mice (Ong et al. 2013; Vucetic et al. 2011) with no changes in females (Ong et al. 2013). The present results do not confirm that HFD administration changes Oprm gene expression, since we only found differences between sexes but not due to HFD intake. A previous study by Kawahara et al. (2013) showed that consumption of palatable food increased DA release in the NAcc via activation of the Oprm pathway in the VTA, and it is well known that this projection has an influence on learning or performance of behaviors based on drug reward (Koob and Le Moal 2005). Regarding CPP studies, the Oprm antagonist naloxone was effective in blocking the reacquisition of a cocaine-induced CPP (Sticht et al. 2010). Although there are very few studies that focus on this relationship, this could account for the difference between males and females, indicating that male mice are modifying their learning processes during extinctions more efficiently than females.

Although HFD did not induce any changes in CB1r gene expression in females, an increase was detected in the HFD-Ext male group. Some studies have reported that HFD upregulates endocannabinoid levels (Massa et al. 2010; Higuchi et al. 2012) and that CB1 antagonists reduce binge eating (Parylak et al. 2012). We have only observed this upregulation in the HFD-Ext group, which could be related to the time required to extinguish the preference, and thus, males in this group could have quickly learned to notice the absence of the conditioned drug in the CPP. This would support that CB1 receptors are modulating the extinction and reinstatement processes, as previous studies show that CB1 antagonists such as rimonabant or AM251 are able to block cocaine and morphine extinction and CPP reinstatement (Yu et al. 2011; Khaleghzadeh-Ahangar and Haghparast 2015). A limitation of the present work is in the analysis of the complete striatum without differentiating the ventral from the dorsal part, which could have offered more specific changes.

Conclusion

The results of the present work support our initial hypothesis, showing that palatable food could act as an alternative reward to cocaine by decreasing its conditioned rewarding memories and blocking cocaine priming-induced reinstatement. These effects are conditioned by sex, as females were less sensitive to the protective action of HFD. Taken together, the present results point out that a sporadic access to HFD stimulates the same pathways activated by drugs, as well as decreasing craving and drug-related memories.

Currently, no controlled studies in humans have been performed to test the role of palatable food in attenuating drug withdrawal. It is well known that palatable food is generally used as self-medication to escape from a negative mood state or stressful situations (Groesz et al. 2012; Ifland et al. 2009; Kim et al. 2013). In 1989, Hatcher reported that substances such as sugar, aspartame, chocolate, and nutritional supplements were compulsively used by patients in rehabilitation centers (Hatcher 1989). Several reports highlighted that during drug withdrawal and abstinence, overeating is a well-known problem in rehabilitation centers (Edge and Gold 2011). For example, Cowan and Devine (2008) found that patients at different stages of recovery from substance addictions experienced an increase in food intake, weight changes, and used food in recovery. Overeating and post-addiction obesity are so common that most abstinence-oriented drug treatment programs schedule diet counseling and mandatory exercise programs (Cowan and Devine 2008). Our results support these clinical reports and suggest that, acting as an alternative reward, palatable food may attenuate drug craving and the vulnerability to relapse. In particular, a more effective reduction in the extinction of drug-related memories occurs when a HFD is consumed temporally close to a drug-related cue exposure.

Contrary to an ad libitum access, our feeding conditions do not have significant consequences on bodyweight or hormonal levels, with minimal changes in opioid and cannabinoid receptors gene expression. These results indicate that it would be beneficial to introduce the intermittent use of palatable food as a “rehabilitation tool” in drug treatment programs, allowing patients to eat it when drug craving is happening. The translational value of this study relies on the lack of metabolic adverse effects of any of the three HFD administrations. However, and no less important, this treatment was significantly less effective in females, which indicates the necessity of conducting more studies in females, since we cannot assume that environmental or nutritional interventions are equally effective in both sexes.