Introduction

Molecular, neuroanatomical, and neurophysiological studies have demonstrated mechanistic similarities between normal forms of learning and memory and central actions of reinforcing drugs; drugs of abuse and Pavlovian and instrumental learning processes act on similar neural pathways in the mesocorticolimbic brain reward system (Beninger and Gerdjikov 2004; Everitt and Robbins 2005; Hyman et al. 2006; Jentsch and Taylor 1999; Stewart 2004; White 1996). Repeated administration of reinforcing drugs, such as cocaine or morphine produces functionally significant alterations in neurobiological processes subserving reward-related learning and memory (Harris et al. 2004; Hyman et al. 2006; Olson et al. 2005; Schoenbaum et al. 2006; Taylor and Horger 1999; Taylor and Jentsch 2001; Valjent et al. 2006). These drug-induced neural alterations in reward-related learning and memory processes may result in enhanced control over drug-seeking behavior by reward-associated contextual stimuli (Berke 2003; Hyman 2005; Self 2004). Enhanced control over drug-seeking behavior by reward-related stimuli may in turn increase the vulnerability of drug users to relapse into drug use upon exposure to drug-associated contexts, such as places paired with their prior drug use (Ehrman et al. 1992; Gawin and Kleber 1986).

Repeated prior exposure to psychostimulants and psychomotor-stimulant induced sensitization enhance the acquisition of appetitive Pavlovian learning and the expression of Pavlovian-cue induced operant behavior (Pavlovian-to-instrumental transfer) when sensitized rats are tested in a drug-free state (Harmer and Phillips 1998; Taylor and Jentsch 2001; Wyvell and Berridge 2001). Moreover, accumbens amphetamine injections potentiates conditioned-reward-directed behaviors in rats tested during acute exposure to this drug (Phillips et al. 2003; Taylor and Robbins 1984; Wyvell and Berridge 2000). However, while previous studies have investigated the effects of acute or repeated psychostimulant drug exposure on the performance of conditioned-reward-directed behaviors, no studies have examined whether previous repeated exposure to conditioned drug rewards (for example the drug-paired context) enhances subsequent learning. An interesting question regarding drug-paired contexts is whether repeated prior pairings of a drug with a certain context can enhance subsequent spatial learning and memory processes.

After repeated pairings with drugs, the contextual environment acquires secondary reinforcing properties (conditioned reward) through Pavlovian learning (Schechter and Calcagnetti 1993). Through Pavlovian learning, rats will choose to spend more time in a location in which they have passively received an injection of psychostimulants or morphine than in another location paired with saline injection (conditioned place preference–CPP; Bardo and Bevins 2000; Tzschentke 1998). Previous exposure to drug-paired contexts through CPP enhances subsequent drug self-administration and reinstatement of drug-seeking behavior (Bardo and Bevins, 2000; Itzhak and Martin 2002). We hypothesized that CPP would facilitate subsequent spatial learning. To that effect, we investigated the effect of morphine- or cocaine-induced CPP on rat spatial learning using the Morris water maze test. Groups of rats acquired morphine (10 mg/kg), cocaine (10 mg/kg), oral sucrose (15%) CPP. Another group of morphine-dependent rats acquired conditioned place aversion (CPA) to a context paired with precipitated opiate withdrawal induced by naloxone injections (1 mg/kg). The sucrose CPP and the morphine CPA groups were included to assess the generality of the effect of our training procedure on subsequent performance and to examine roles of different cue stimuli (rewarding or aversive) in a spatial learning task. To examine the role of conditioned reward or aversion in spatial learning, rats were then exposed to the previous morphine-, cocaine-, sucrose-, or naloxone-paired context for 10 min before training of spatial learning in the Morris water maze.

Materials and methods

Subjects and drugs

One hundred seven male Sprague–Dawley rats, weighing 220–270 g, were housed individually (experiment 3) or in groups of four (experiment 1–2 and 4) under a constant temperature (23 ± 2°C) and maintained on a 12-h light/dark cycle with free access to food and water except in the case of rats in the sucrose training paradigm. All the treatments of the rats were performed in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals (NIH publication no.86–23, 1996). The procedures were approved by the local Committee of Animal Use and Protection. The drugs used were cocaine hydrochloride and morphine hydrochloride (Qinghai Pharmaceutical, China). The naloxone hydrochloride was purchased from Sigma (SI, USA). Cocaine, morphine, and naloxone were dissolved in saline.

Training for morphine and cocaine CPP

The apparatus for conditioned place preference training consisted of six identical three-chamber PVC boxes (Med Associates, USA) in light- and sound-controlled cubes. Two large chambers (27.9 cm long × 21.0 cm wide × 20.9 high cm) were separated by a smaller chamber (12.1 cm long × 21.0 cm wide × 20.9 cm high), with differences in wall color and floor texture. One large chamber was black with a floor of stainless-steel bars (4.8 mm placed on 1.6-cm centers). The other large chamber was white with a floor of a stainless-steel mesh (1.3 × 1.3 cm). The middle smaller chamber was gray with a smooth PVC floor. Three distinct chambers were separated by manual guillotine doors. According to previously study, rats had a significant natural preference for the black chamber over the white chamber, and bias could be neutralized by manipulating the ambient lighting within the CPP procedure (Roma and Riley 2005). The intensity of ambient illumination was adjusted to balance the side preferences for each large chamber. The conditioning sessions were performed in the light phase in the presence of white noise. To determine baseline preference (preconditioning), the rats were initially placed in the middle chamber with the doors removed for a period of 15 min. The time spent in the previously saline- or drug-paired chambers during the 15-min session was determined using MED-PC system (Med Associates, USA). Conditioning was performed using an unbiased, balanced protocol as in our previous studies (Lu et al. 2001; Wang et al. 2006). About 10% total subjects (13/120), which spent 150 s in one large chamber more than in another, were discarded.

Morphine and cocaine CPP training sessions consisted of 8 alternating days of drug and saline injections with the guillotine doors in place (Lu et al. 2000; Wang et al. 2006). The chamber in which morphine or cocaine was administered was assigned randomly. Thus, each rat was treated for 8 consecutive days with alternate injections by drugs (morphine 10 mg/kg, s.c.; cocaine 10 mg/kg, i.p.) and saline (1 ml/kg). The drug doses selected for CPP conditionings were based on our and other previous studies (Bardo et al. 1995; Lu et al. 2001; Wang et al. 2006). The most effective morphine and cocaine doses during acquisition of CPP were 10 mg/kg morphine, s.c. or 10 mg/kg cocaine, i.p. respectively (Bardo et al. 1995; Lu et al. 2001; Tzschentke 1998; Wang et al. 2006). After each injection, the rats were immediately confined to the conditioning chamber for 45 min before they returned to the home cage. On day 9, the rats were tested for the acquisition (postconditioning) of morphine or cocaine CPP under conditions identical to those described for the preconditioning test: The doors were removed and the rats were placed in the middle chamber and allowed to move freely for 15 min.

Training for sucrose CPP

The procedure for training sucrose CPP was modified from previous studies (Alderson et al. 2001; Delamater et al. 2000). Briefly, rats were individually housed with free access to drinking water but limited access to 10 g lab chow per day for 8 days before training. Under this rearing condition, rats maintained 80% body weight. The cover of the two large chambers of the CPP box was modified for liquid supply through a drinking tube. Before the preconditioning test and the training session, drinking water was removed from the home cage for 2 h. Preconditioning test was performed as described above except that plain water was available on both side. Sucrose CPP training consisted of 14 alternating days of sucrose and plain water available with the guillotine doors in place. During training for CPP, rats were allowed free access alternately to sucrose solution (15%) and water for 30 min through a tube. The chamber in which sucrose was delivered was assigned randomly. Our preliminary experiment demonstrated that this procedure produced the largest magnitude of sucrose CPP in our rats. On day 15, the rats were tested for 15 min for the acquisition of sucrose CPP under conditions identical to the preconditioning test.

Training for naloxone-conditioned place aversion

The procedure of naloxone-precipitated withdrawal-induced CPA in morphine-dependent rats was based on previous studies (Lu et al. 2000, 2005; Mucha 1987; Salem and Hope 1997; Watanabe et al. 2002, 2003). Rats received morphine s.c. injections twice a day for 7 days by an increasing dose regimen of 10, 15, 20, 25, 30, 35 or 40 mg/kg, respectively (Lu et al. 2005). From days 8 to 14, rats were given injections of morphine 40 mg/kg twice a day with a 12-h interval between injections (8:00 a.m. and 8:00 p.m.). On day 8, rats were tested for their preconditioning CPA at 2–4 h before the second morphine injection. Naloxone-conditioned place aversion training started on day 9. Conditioning was performed in six alternate sessions of naloxone and saline treatments. Thirty minutes after first injection of morphine 40 mg/kg (s.c.), rats were injected with naloxone 1 mg/kg (i.p.) or saline (1 ml/kg). After each injection of naloxone or saline, the rats were immediately confined to the conditioning chamber for 30 min before they returned to the home cage. On day 15, the rats were tested for the acquisition of naloxone CPA under conditions identical to those described for the CPP test.

Place navigation task

Rats were habituated to the swimming pool environment the day after the last CPP test. Two days after the last CPP test, a place navigation task was conducted using the Morris water maze (Morris et al. 1982, 1986). The apparatus consisted of a circular black-painted swimming pool (150 cm in diameter × 50 cm high) filled with water at 22–24°C and made opaque by the addition of black ink. The pool was located in a testing room containing numerous objects on the wall that the animals can use as cues for spatial orientation. The position of these environmental cues remained unchanged throughout the entire testing period. The pool was divided into four equal quadrants. The black platform was invisible by making the circular of Morris water maze black and the water opaque by addition of black ink. The invisible black platform (9 cm in diameter and submerged 1.5 cm below the water surface) was placed in the center of the quadrant I. On day 1, the rats were habituated to the swimming pool environment. They were placed individually on the platform for 20 s and in the pool for 60 s and allowed to swim. During the training period of the task, rats were given three trials per day to find the hidden platform for 5 consecutive days (maximum trial duration 60 s, 20-s reinforcement on the platform). Each rat was gently placed into the water, with the nose pointing toward the wall at one of the starting points designated on the rim center of quadrants II, III, and IV. Rats that failed to find the hidden platform were given a score of 60 s and then physically placed on the platform for 20 s. The time required for rats to climb onto the platform was recorded as escape latency. The mean of the three trials per training day was scored.

Experimental design

As described above, each experiment consisted of three phases: training for CPP (or CPA), test for expression of CPP (or CPA), and training for spatial learning in the Morris water maze. Experiments 1–3 examined the effects of drugs (experiments 1 and 2) or sucrose (experiment 3) conditioned reward on spatial learning and memory. After testing for acquisition of CPP for morphine, cocaine, or sucrose, rats were randomly assigned to three groups (conditioned reward, non-conditioned reward and control group, n = 8–10 per group) and subsequently tested for spatial learning in the Morris water maze task, as described above. During the testing session, separate groups of rats were given different treatments: (1) rats in the conditioned reward group were placed in the previously morphine-, cocaine- or sucrose-paired chamber for 10 min each day before placement in the water maze; (2) rats in the non-conditioned reward group were placed in the previously saline- or water-paired chamber for 10 min each day before placement in the water maze; (3) rats in the control group (prior reward-experience condition) were not exposed to any CPP chamber before placement in the water maze.

Experiment 4 examined the effects of conditioned aversion on spatial learning and memory. Rats were trained on acquisition of naloxone CPA as described above. Subsequently, rats were randomly assigned to three groups (conditioned aversion, non-conditioned aversion, and control group, n = 8 per group) and subsequently tested for spatial learning in the water maze task, as described above. During the test for spatial learning, separate groups of rats were given different treatments: (1) rats in the conditioned aversion group were placed in the previously naloxone-paired chamber for 10 min each day before placement in water maze; I2) rats in the non-conditioned aversion group were placed in the previously saline-paired chamber for 10 min each day before placement in the water maze; (3) rats in the control group (prior precipitated withdrawal condition) were not exposed to any chamber prior to placement in the water maze.

Statistical analysis

The amount of time spent in drug- or sucrose-paired sides, the scores of place preference or aversion, and the time of escape latency for finding the platform during the water maze test were expressed as means ± SEM. Data were analyzed using a mixed factorial design with treatment condition (conditioned reward, non-conditioned reward, and control) as the between-subjects factor and test day (preconditioning and postconditioning in CPP or CPA test and days 1–5 in test of water maze) as the within-subjects factor. A subsequent one-way analysis of variance (ANOVA) was used to reveal the effect of treatment condition at each test day. A post-hoc analysis using the Student–Newman–Keuls test was used to evaluate differences among treatment conditions in time spent in the drug- or sucrose-paired side or in escape latency for finding the platform. Statistical differences at P < 0.05 were considered significant.

Results

Effects of morphine conditioned reward on spatial learning

Figure 1a shows the time spent in the morphine-paired chamber during the preconditioning and postconditioning of CPP. Overall, as in our previous studies (Lu et al. 2000, 2001), rats in the different groups [conditioned reward, non-conditioned reward and control (prior reward-experience)] acquired morphine CPP after 8-day alternate injections of morphine and saline. As no significant differences were found on time spent in the morphine-paired chamber among the three groups during the preconditioning or postconditioning test session (P > 0.05), the data from three groups were combined. The one-way ANOVA analysis revealed a significant increase in the time spent in the morphine-paired chamber from the pre- to post-conditioning place preference tests [F (1, 29) = 49.1, P < 0.05].

Fig. 1
figure 1

Effects of morphine conditioned reward on spatial learning. a Acquisition of morphine CPP. Data are depicted as the time (mean ± SEM) spent in the morphine-paired chamber during the preconditioning and postconditioning of CPP. Different from preconditioning, *P < 0.05 (total n = 30, 3 groups). b Effects of exposure to previous morphine-paired chamber on spatial learning. Data are depicted as the time (mean ± sem) spent in finding the hidden platform. Different from the control group, *P < 0.05, n = 9–11 per group

Figure 1b shows the time spent in finding and climbing onto the hidden platform. Rats that were exposed to the previously morphine-paired side spent significantly less time to find the hidden platform, which indicated that exposure to a conditioned reward increased spatial learning and memory. Data analysis revealed significant effects of treatment condition [F (2, 27) = 16.6, P < 0.05), test day [F (4, 108) = 46.1, P < 0.05] and treatment condition × test day [F (8, 108) = 12.7, P < 0.05). However, no significant differences in swimming speed (14.5, 13.9, and 14.1 cm/s, for conditioning reward, non-conditioning reward, and control group, respectively) were observed among different groups (P > 0.05). Post-hoc group differences are indicated in Fig. 1b.

Effects of cocaine conditioned reward on spatial learning

Figure 2a shows the time spent in the cocaine-paired chamber during the preconditioning and postconditioning of CPP. As in experiment 1, rats in the different groups (conditioned reward, non-conditioned reward, and control) acquired cocaine CPP after an 8-day alternate injections of cocaine and saline. As no significant differences were found on time spent in the cocaine-paired chamber among the three groups during the preconditioning or postconditioning test session (P > 0.05), the data from three groups were combined. One-way ANOVA analysis revealed a significant increase in the time spent in the cocaine-paired chamber from the pre- to post-conditioning place preference tests [F (1, 28) = 22.5, P < 0.05].

Fig. 2
figure 2

Effects of cocaine conditioned reward on spatial learning. a Acquisition of cocaine CPP. Data are depicted as the time (mean ± sem) spent in the cocaine-paired chamber during the pre-conditioning and post-conditioning of CPP. Different from pre-conditioning, *P < 0.05 (total n = 29, 3 groups). b Effects of exposure to previous cocaine-paired chamber on spatial learning. Data are depicted as the time (mean ± SEM) spent in finding the hidden platform. Different from the control group, *P < 0.05, n = 9–10 per group

Figure 2b shows the time spent in finding and climbing onto the hidden platform. Rats that were exposed to the previously cocaine-paired side spent significantly less time to find the hidden platform, which indicated that exposure to cocaine conditioned reward increased spatial learning and memory. Data analysis revealed significant effects of treatment condition [F (2, 26) = 38.8, P < 0.05), test day [F (4,104) = 65.8, P < 0.05) and treatment condition × test day [F (8, 104) = 15.4, P < 0.05). However, no significant differences in swimming speed (15.1, 14.7, and 15.3 cm/s, for conditioning reward, non-conditioning reward, and control group, respectively) were observed among different groups (P > 0.05). Post-hoc group differences are indicated in Fig. 2b.

Effects of sucrose conditioned reward on spatial learning

As in experiment 1, rats in the different groups (conditioned reward, non-conditioned reward, and control) acquired sucrose CPP after 14 days of alternate access to 15% sucrose solution and water. As no significant differences were found in time spent in the sucrose-paired chamber among the three groups during the preconditioning or postconditioning test session (P > 0.05), the data from the three groups were combined. The one-way ANOVA showed that there was a significant increase in the time spent in the sucrose-paired chamber from the pre- to post-conditioning place preference tests [F (1, 23) = 10.0, P < 0.05].

Rats that were exposed to the previously sucrose-paired side spent a similar time to find the hidden platform as the control and non-conditioned reward groups, indicating that previous exposure to sucrose conditioned reward does not affect spatial learning and memory. Data analysis revealed significant effects of test day [F (4, 84) = 15.0, P < 0.05) but not for treatment condition or treatment condition × test day (P > 0.05).

Effects of naloxone-conditioned aversion on spatial learning

Rats in the different groups (conditioned aversion, non-conditioned aversion, and control) acquired naloxone CPA after 6 days of alternate injections of naloxone and saline after chronic morphine administration. As no significant differences were found in time spent in the naloxone-paired chamber among the three groups during the preconditioning or postconditioning test session (P > 0.05), the data from the three groups were combined. The one-way ANOVA revealed a significant decrease in the time spent in the naloxone-paired chamber from the pre- to post-conditioning place preference tests [F (1, 23) = 28.9, P < 0.05].

No significant differences in time spent to find the hidden platform were found among groups, indicating that exposures to drug-conditioned aversion do not affect spatial learning and memory. Data analysis revealed significant effects for test day [F (4, 84) = 32.5, P < 0.05] but not for treatment condition or treatment condition × test day (P > 0.05).

Discussion

The present study examined the effects of cocaine- and morphine-conditioned reward on spatial learning and memory, as assessed with the Morris water maze task. The main finding in this report was that after the acquisition of CPP, rats exposed to a context previously paired with morphine or cocaine but not a context previously paired with sucrose or naloxone-precipitated withdrawal, spent significantly less time to find a hidden platform in the Morris water maze, relative to rats not previously exposed to a drug-paired context.

Contextual cues that are associated with morphine or psychostimulant availability can promote psychomotor sensitization (Badiani et al. 2000; Crombag et al. 2000). In our present study, no significant differences in swimming speed were observed among pre-exposed to the drug (cocaine or morphine) paired-, unpaired-context, and control groups. Thus, it is unlikely that rats pre-exposed to the cocaine or morphine-paired context have exhibited enhanced performance in the Morris water maze task due to the expression of psychomotor sensitization induced by the drug-paired context.

Previous studies have shown that prior exposure to drugs of abuse enhances the learning of appetitive Pavlovian-conditioned responses and spatial learning as assessed by the Morris water maze task in animals tested in a drug-free state. Repeated intermittent exposure to psychomotor stimulant drugs leads to enhanced acquisition of Pavlovian conditioning or of Pavlovian-cue-directed operant behaviors in animals tested in a drug-free state (Harmer and Phillips 1998, 1999; Taylor and Jentsch 2001; Wyvell and Berridge 2001). Previous exposure to morphine enhances spatial learning as assessed with the Morris water maze task (Yang et al. 2004). However, no studies have assessed the effects of exposure to a drug-paired context on spatial learning. The results of the present study provide the first demonstration that conditioned -drug reward promotes spatial learning and memory in rats.

Cocaine- or morphine-conditioned reward, but not sucrose conditioned reward, may have enhanced subsequent spatial learning and memory in the Morris water maze for several reasons. It has demonstrated that morphine or cocaine, but not sucrose, may have sensitized brain areas involved in spatial learning rendering them hypersensitive to spatial or contextual-stimuli predicting reinforcement. For example, repeated daily administration of cocaine has been shown to induce functional changes in voltage-dependent sodium channels in rat hippocampal CA1 pyramidal neurons, increasing the excitability of those neurons (Zhai et al. 1997). Cocaine administration increases the excitability of Shaffer collateral synapses of CA1 pyramidal neurons in the hippocampus, which can enhance processing of neuronal signals through hippocampal circuitry (Hammad and Wagner 2006) that is involved in spatial learning (Foster et al. 1989; McNaughton et al. 1983; O'Keefe 1983). Mu-opioid receptor agonists also increase the excitability of hippocampal neurons after electrical stimulation of synaptic afferents to the dentate gyrus and CA1 regions of the rat hippocampus; mu-opioid receptor induced suppression of gamma-aminobutyric acid (GABA) release onto GABA-A receptors facilitates excitatory activity of CA1 hippocampal neurons (McQuiston and Saggau 2003; Neumaier and Chavkin 1986). Interestingly, hippocampal circuitry that is necessary for spatial learning (Morris et al. 1990; Moser et al. 1993) is also necessary for both the acquisition and expression of established morphine and cocaine CPP (Meyers et al. 2006; Milekic et al. 2006). Thus, the hippocampal formation is important for directing approach behaviors elicited by cocaine or morphine contextual stimuli and also for using salient spatial stimuli to guide exploratory behavior necessary to attain reinforcement in navigation tasks. As cocaine- or morphine-conditioned stimuli induce a conditioned enhancement of brain reward function (Bespalov and Zvartau 1992; Kenny et al. 2003) and morphine and cocaine increase the excitability of hippocampal circuitry that is necessary for spatial learning (see above), previous cocaine or morphine context may facilitate spatial learning by sensitizing hippocampal circuitry necessary for spatial learning, increasing its sensitivity to spatial information predictive of reinforcement. In contrast, there is no evidence, to our knowledge, suggesting that sucrose reward alters the brain circuitry required for spatial learning in the same manner as drugs of abuse.

The other reason for different roles of drug-conditioned reward and sucrose conditioned reward in spatial learning may be due to different neuronal processes in mesolimbic system induced by natural reward-related cues vs cue associated with drugs of abuse. It has been reported that animals were trained to administer heroin associated with a compound audio-visual cue; re-exposure to the cue after 3 weeks of withdrawal reinstated heroin-seeking behavior, which resulted in immediate early gene (IEG) expression of ania-3 (a member of the Homer family), Mitogen-activated protein (MKP)-1, c-fos and nuclear receptor subfamily 4, group A, member 3 (Nr4a3) in the medial prefrontal cortex (mPFC), and of ania-3 in the orbital frontal cortex (OFC) and nucleus accumbens core (NAc). The activation patterns for heroin-seeking behaviors did not generalize to sucrose-seeking behaviors, indicating that the two behaviors involve different connectivity pathways of neuronal signaling (Koya et al. 2006). Thus, drugs of abuse, but not sucrose, may induce sensitization of hippocampal circuitry required for spatial learning accounting for the differences found in the present study wherein cocaine or morphine conditioned reward, but not sucrose conditioned reward, facilitated subsequent spatial learning and memory.

Another question raised by the present study is why conditioned reward, but not conditioned aversion, facilitates subsequent spatial learning and memory. Previous studies have shown that when drug reinforcement has been repeatedly paired with drug-associated cues, these conditioned cues can elicit Pavlovian approach responses and preference for the drug-paired environment (Bardo and Bevins 2000). In preclinical studies, stimuli that are associated with reinforcement and increased dopamine release in the mesolimbic system, such as intracranial self-stimulation (ICSS), have been shown to enhance the performance of a variety of learned responses including operant and spatial learning (Aldavert-Vera et al. 1996; Segura-Torres et al. 1988, 1991; Yoganarasimha et al. 1998). Stimulant drugs such as amphetamine and cocaine (as well as others), but not aversive stimuli such as those associated with opiate withdrawal, exaggerate the incentive salience of conditioned stimuli that are closely associated in space and time with self-administered drugs (Robinson and Berridge 1993). We therefore propose that exposure to the drug-paired environment acting as a conditioned reward may elicit incentive salience associated with reinforcement that could enhance the learning or retention of a spatial-navigation task. This hypothesis is supported by our current results. Rats that were pre-exposed to the aversive naloxone-paired and drug-unpaired side did not exhibit improved spatial learning and memory, unlike the rats pre-exposed to a morphine- or cocaine-paired environment that exhibited an enhancement of this behavior.

In summary, the present study investigated the effect of conditioned drug reward on spatial learning and memory using the CPP paradigm and the Morris water maze task. Our results suggest that acute exposure to the morphine or cocaine-paired context enhances subsequent spatial learning and memory.