Introduction

Impoverishment of access to conventional sources of reinforcement may be a risk factor for substance abuse, particularly in lower socioeconomic populations (Richman 1977; Wall et al. 2011), possibly because drugs eventually come to outcompete available naturalistic rewards. Human substance dependence is a minority outcome within the population exposed to a given substance (Anthony et al. 1994; Schramm-Sapyta et al. 2009), and many, but not all, individuals who use drugs in adolescence stop as they develop into early adulthood. The dependent versus casual use trajectories are potentially differentiated by the relative importance of drug use versus other life goals and demands (Flory et al. 2004; Juon et al. 2011; Maume et al. 2005); thus, it is of significant interest to model such factors in laboratory studies. A few parallel examples exist in rat studies. Home cage environmental enrichment has been shown to attenuate the acquisition of cocaine (Puhl et al. 2012) and alcohol self-administration (Deehan et al. 2011) and to diminish the rewarding effects of heroin in rats (El Rawas et al. 2009) and cocaine in mice (Solinas et al. 2008). It has even been shown that sexual rejection by previously mated female flies increases the ethanol intake of male Drosophila melanogaster (Shohat-Ophir et al. 2012).

The opportunity to use an activity wheel is rewarding and reinforcing in laboratory rodents; wheel access will increase the probability of an operant response in rats (Hundt and Premack 1963; Premack et al. 1964) and will maintain lever pressing under a variety of schedules (Belke 2010; Belke and Hancock 2003; Collier and Hirsch 1971; Pierce et al. 1986). Concurrent access to cocaine intravenous self-administration (IVSA) and an activity wheel are mutually suppressing on female rats’ drug intake and activity (Cosgrove et al. 2002), and concurrent access to a wheel suppresses initial d-methamphetamine IVSA in male rats (Miller et al. 2012). In this latter study, once IVSA was established, the introduction of wheel access did not affect drug taking; however, ongoing IVSA of MA gradually decreased the amount of wheel activity. Furthermore, rats introduced to the wheel after 7 or 14 IVSA sessions initially ran very little compared with drug-naive controls. A contrast of rat strains which differed sixfold in spontaneous running showed that the effect of concurrent wheel access on MA IVSA was independent of distance traveled on the wheel. Thus, the effect is most parsimoniously attributed to the rats’ spontaneous preference level, i.e., the reward value of wheel activity.

The present study was conducted to further test the hypothesis that co-option and devaluation of wheel running in rats are a consequence of establishing a consistent pattern of stimulant drug IVSA. The goal was to use conditions under which the reinforcer was highly efficacious and animals undergo an acquisition curve from zero to relatively stable intake across the test interval. The emerging (Benzie et al. 2011; Bluelight 2006; Borek and Holstege 2012; Ross et al. 2012; Wyman et al. 2013) substituted cathinone drug 3,4-methylenedioxypyrovalerone (MDPV) was selected as the stimulant drug model since it is readily self-administered by rats and may be more efficacious than MA (Aarde et al. 2013b; Watterson et al. 2014). MDPV is a potent monamine transporter inhibitor with high dopamine selectivity (Baumann et al. 2013; Cameron et al. 2013; Eshleman et al. 2013; Simmler et al. 2013). The Miller et al. study (2012) used initial food-reinforced lever training which leads to relatively stable drug intake from the start of self-administration (e.g., (Aarde et al. 2013b; Miller et al. 2012) and may have advantaged drug taking over wheel activity. Thus, the present study did not include any lever training before MDPV access.

Methods

Animals

Male Wistar rats (Charles River, NY; N = 32) were housed in a humidity and temperature-controlled (23 ± 1 °C) vivarium on a 12:12-h light/dark cycle. Animals entered the laboratory at 10–11 weeks of age and weighed ∼370 g at the start of the study. All procedures were conducted under protocols approved by the Institutional Care and Use Committees of The Scripps Research Institute and in a manner consistent with the National Institutes of Health Guide for the Care and Use of Laboratory Animals (2011; 8th ed).

Drugs

The racemic 3,4-methylenedioxypyrovalerone HCl used for this study was synthesized as described previously (Aarde et al. 2013b); doses are described as the salt. Drug was dissolved in physiological saline for injection.

Apparatus

Operant conditioning chambers (Med Associates Model ENV-045; Med-PC IV software) enclosed in sound-attenuating cubicles were used for concurrent wheel access and self-administration as previously described (Huang et al. 2012; Miller et al. 2012). Chambers were equipped with slanted activity wheels (∼100-cm inner circumference, 30° tilt of running surface relative to the disk/back), which allowed wheel access while rats were tethered for self-administration.

Intravenous catheterization

Rats were anesthetized with an isoflurane/oxygen vapor mixture (isoflurane 5 % induction, 1–3 % maintenance) and prepared with chronic intravenous catheters as described previously (Aarde et al. 2013a; Aarde et al. 2013b). Briefly, the catheters consisted of a 14-cm length of polyurethane-based tubing (Micro-Renathane®, Braintree Scientific, Inc, Braintree MA, USA) fitted to a guide cannula (Plastics One, Roanoke, VA) curved at an angle and encased in dental cement anchored to an ∼3-cm circle of durable mesh. Catheter tubing was passed subcutaneously from the animal’s back to the right jugular vein. Catheter tubing was inserted into the vein and tied gently with suture thread. A liquid tissue adhesive was used to close the incisions (3M™ Vetbond™ Tissue Adhesive; 1469SB).

A minimum of 4 days was allowed for surgical recovery prior to starting an experiment. For the first 3 days of the recovery period, an antibiotic (cefazolin; 0.4 mg/kg, i.m. day 1, subcutaneous (s.c.) days 2–3) and an analgesic (flunixin; 2.5 mg/kg, s.c.) were administered daily. During recovery, as well as during testing and training, intravenous catheters were flushed with heparinized saline (before sessions) and heparinized saline containing cefazolan (100 mg/mL; after sessions).

Catheter patency was assessed after the last session of the week via administration through the catheter of ∼0.2 mL (10 mg/mL) of the ultra-short-acting barbiturate anesthetic Brevital sodium (1 % methohexital sodium; Eli Lilly, Indianapolis, IN). Animals with patent catheters exhibit prominent signs of anesthesia (pronounced loss of muscle tone) within 3 s after infusion. In this study, a total of N = 26 survived to the end of the described studies with patent catheters.

Intravenous self-administration

All animals were drug and operant-training naive prior to the start of this study. Subjects were transported to an experimental room (ambient temperature 23 ± 1 °C; illuminated by red light) and placed into the operant conditioning chambers for drug self-administration testing 7–8 h after the beginning of the vivarium dark cycle. The catheter fittings on the animals’ backs were connected to polyethylene tubing contained inside a protective spring suspended into the operant conditioning chamber from a liquid swivel attached to a balance arm. Each operant conditioning session started with the extension of two retractable levers into the chamber and lasted for 60 min. Following completion of one lever press (FR1) on the drug-associated lever (same side for all animals), a white stimulus light (above the lever) signaled delivery of the reinforcer (infusion volume = 0.1 mL; duration = 4 s) and remained on during a 20-s post-infusion time-out, during which responses were recorded but had no scheduled consequences. There were no consequences for responses on the alternate lever. After the first two operant conditioning sessions, a single priming infusion of drug was administered non-contingently for any session in which a rat had not responded on the drug-paired lever within the first 30 min.

Data analysis

Measures included infusion rates (infusions/h or infusions/5-min sampling interval), quarter wheel rotations (∼25 cm), lever discrimination (% of presses on the drug-paired lever out of all lever presses, not including time-out), mean post-reinforcement pause (PRP), percent of PRPs equal to the 20-s time-out (PRP% 20 s), and time-out drug-paired lever press rate. Factors included the within-subjects factors of self-administration session (1–20) and acquisition phase (pre vs. post) as well as the between-subjects factors of wheel-access group (unlocked wheel vs. locked wheel) and acquisition-pattern group (binge-like acquisition pattern vs. non-binge acquisition pattern). The binge-like pattern was operationally defined by ≥8 maximum infusions in a given 5-min interval.

Repeated measures analysis of variance (rmANOVA) was used to analyze infusion rate, lever discrimination, and quarter wheel rotations as a function of session as well as to analyze average infusion rate and average quarter wheel rotations as a function of acquisition phase. Post-acquisition measures did not include the “binge” session (addressed in separate analyses; see below). For two rats that acquired IVSA on the first session, pre-acquisition values were derived by mean unit imputation to permit the pre/post-acquisition analyses. Post hoc comparisons for means separation of effects confirmed by rmANOVA utilized Tukey’s honest significant difference method (Tukey’s HSD). Additionally, a Kaplan-Meier survival analysis was used to analyze the rate of acquisition. The acquisition criterion was defined as a sustained intake of ≥6 infusions/h/session, and for the analysis, non-acquiring rats were right censored at 20 operant conditioning sessions and group differences were analyzed with the Mantel-Cox log rank test. Comparisons of time-out lever pressing rate and PRP on the binge session (for non-binge rats, the session for which acquisition criteria was reached was used for comparison) as a function of binge group and wheel group utilized non-parametric Mann-Whitney U tests (Bonferroni corrections; four comparisons per family; corrected for ties; two-tailed). Lastly, the comparison of PRP% 20 s on the binge session used a single t test (unpaired, two-tailed) and was between wheel groups for the binge-like rats only (none of the non-binge rats exhibited a 20-second PRP on their day of acquisition).

Analyses were conducted with GB-STATv7.0 (Dynamic Microsystems, Silver Spring MD) and StatView (SAS Institute, Inc., Cary, NC). Graphs were generated with Microsoft Excel (Microsoft, Redmond WA) and StatView and figures created with Microsoft PowerPoint (Microsoft, Redmond WA), Adobe Illustrator (Adobe Systems Incorporated; San Jose, CA), and Canvas (ACD Systems, Seattle WA).

Results

Wheel group assignment

All animals were initially tested over two sequential days in 1-h periods of access to the activity wheels (untethered). Subsequent self-administration wheel-access groups were confirmed to remain statistically indistinguishable (t(24) = 0.5, p = 0.6) even after the loss of six rats (three from each wheel-access group) due to loss of catheter patency. Mean pre-self-administration wheel activity for the animals that remained patent was 915 quarter rotations/h (SD = 350; SEM = 98) for the unlocked-wheel group and 840 quarter rotations (SD = 350; SEM = 97) for the locked-wheel group.

Unlocked vs. locked wheel

3,4-Methylenedioxypyrovalerone infusions

The mean number of infusions of MDPV (0.05 mg/kg/infusion; Fig. 1a) obtained increased across the 20-session acquisition interval (main effect of session: F 19,456 = 5.9; p < 0.0001). However, the groups did not differ in infusions; thus, there was no effect of wheel access on infusions obtained. Post hoc comparisons, collapsed across group, confirmed that infusions were higher in operant conditioning sessions 5–6 and 11–20 relative to 1; in sessions 5, 11, and 15–20 relative to 2; in sessions 5 and 15–20 relative to 3; and in sessions 17–20 relative to 4. No significant differences in infusions were confirmed for sessions 6–20. Lastly, there was no effect of wheel condition on the rate of acquisition (sessions to acquire for both groups: M = 6, SD = 5, SEM = 2) as shown in Fig. 2.

Fig. 1
figure 1

Intravenous self-administration (IVSA) acquisition measures as a function of wheel group (unlocked vs. locked) and IVSA session. Group (N = 13 per group) means of a infusion of 3,4-methylenedioxypyrovalerone (0.05 mg/kg/infusion), b lever discrimination, and c quarter wheel rotations (unlocked group only) are presented. The numbers above session means indicate significant differences between that session and the session indicated by the number. Error bars represent ± SEM

Fig. 2
figure 2

Acquisition criterion. a Individual infusions are depicted by wheel group (locked vs. unlocked). The acquisition criterion was defined as a sustained intake of ≥6 infusions/h/session, indicated by the horizontal dotted line. b The proportion of the unlocked-wheel and locked-wheel groups that had met acquisition criteria on a given session

Lever discrimination

The proportion of lever presses on the drug-paired lever increased significantly across operant conditioning sessions (F 19,456 = 5.9; p < 0.0001; Fig. 1b). The ANOVA did not confirm any significant effects of wheel-access group or interaction between group and session. Post hoc comparisons, collapsed across group, confirmed that lever discrimination was higher in sessions 10, 13–14, and 17–20 relative to 1; in sessions 10–11 and 13–20 relative to 2 and 5; in sessions 13 and 19–20 relative to 3; and in sessions 13–14 and 17–20 relative to 4. No significant differences in lever discrimination were confirmed for sessions 7–20.

Wheel activity

Mean quarter wheel rotations in the unlocked-wheel group were initially 730 (SD = 336; SEM = 93) and significantly decreased across the 20-session acquisition period to below 300 quarter rotations in the final two operant conditioning sessions (main effect of session, F 19,228 = 2.7; p < 0.001; Fig. 1c). Post hoc comparisons confirmed that quarter rotations were lower in sessions 19–20 relative to session 6.

Individual wheel activity and drug intake

Across operant conditioning sessions

The individual patterns of wheel activity and drug lever responding as a function of acquisition session were contrasted (Fig. 3) in an effort to provide further insight into MDPV IVSA acquisition that would be obscured by the group-averaging approach. Indeed, self-administration for 6 of 13 individuals in the locked-wheel group (rat nos. 514, 512, 510, 501, 509, and 507) and for 7 of 13 in the unlocked-wheel group (rat nos. 572, 513, 515, 574, 506, 586, and—though not clear at this level of analysis, see below and Fig. 4—rat no. 511) exhibited a binge-like acquisition pattern. For these individuals, the maximum intake was observed early in each individual’s interval of stable intake, and this initial high-intake binge was typically followed by stable intake at a lower or gradually increasing level thereafter. At this level of analysis, the distributions of MDPV intake patterns of the unlocked-wheel and locked-wheel groups were qualitatively similar.

Fig. 3
figure 3

Individual traces of infusions obtained and wheel rotations as a function of IVSA session. a Individual MDPV infusions earned (filled circles) are depicted for rats in the locked-wheel group. b Individual MDPV infusions earned (filled circles) and wheel activity emitted (quarter wheel rotations, QWR; unfilled circles) across the 20-session acquisition period are depicted the unlocked-wheel group. For both groups, the individual subject number and the acquisition-pattern grouping (B binge-like, N non-binge) are indicated

Fig. 4
figure 4

Within-session infusions obtained and wheel rotations as a function of 5-min sampling interval and session (specifically the sessions before, of, and after acquisition): MDPV infusions and quarter rotations (QWR) of the wheel per 5-min sampling intervals spanning the sessions around the time of acquisition (−1 = before, 0 = of, and 1 = after; except for rat 505 as it acquired on its first session) for rats in the unlocked-wheel group (UW) that met the acquisition criteria. The acquisition-pattern group to which animals were assigned is indicated in the upper right corner (B binge-like, N non-binge-like). Six of the 7 binge rats (572, 515, 574, 506, 513, and 511; top six panels) exhibited sustained, relatively precipitous drops in wheel activity around the time within the session when the infusion rate rose sharply. Wheel activity did not, in general, decrease across sessions prior to the spike in intake for binge rats nor for any session in non-binge rats

Eight of the 13 unlocked-wheel individuals exhibited behavior patterns in which wheel activity dropped notably across the acquisition interval (Fig. 3). For six of these animals, this was precipitous and occurred simultaneously with a significant increase in drug intake (i.e., notable crossovers in the traces for infusions and wheel activity for rat nos. 572, 513, 515, 574, 506, and 511) while the other two rats (i.e., rat nos. 505 and 504) exhibited a comparatively more gradual reduction in wheel activity in the midst of a stable pattern of drug intake. Of the remaining five rats, the three that did not make acquisition criterion (i.e., rat nos. 502, 573, and 580) showed stable or increasing wheel activity across most of the acquisition interval while the last two rats (i.e., rat nos. 503 and 586) showed a multiphasic pattern of wheel activity.

Within operant conditioning sessions, surrounding the binge

Data that are averaged across individuals (Fig. 1) and even across operant conditioning sessions within-individual (Fig. 3) obscure the close temporal relationship that was observed between reductions in wheel activity and the acquisition of MDPV self-administration and also underestimate the maximum infusion rate. As can be seen in the plots of within-session changes in infusion rate and wheel activity as a function of 5-min sampling interval (Fig. 4), the appearance of a crossover from wheel activity to drug self-administration occurred in the timescale of minutes within the binge session—rather than gradually over a session or between sessions—for six of the seven rats that exhibited a binge-like acquisition pattern. The within-session analysis also illustrates that when the binge-like pattern (defined by ≥8 maximum infusions in a given 5-min interval) occurs late in the session (i.e., rat no. 511), the overall session intake may not reflect the binge phenotype. Finally, the sustained reduction in wheel activity was not observed in rats for which no binge-like acquisition pattern was observed.

Binge-like vs. non-binge acquisition patterns

3,4-Methylenedioxypyrovalerone infusions and wheel rotations—pre- vs. post-acquisition

Independent of wheel condition (unlocked wheel, locked wheel), the animals that exhibited a binge-like pattern of MDPV self-administration acquisition (defined by ≥8 maximum infusions in a given 5-min interval) had higher mean post-acquisition infusions per hour (M = 21, SD = 6, SEM = 2) than rats that did not show this pattern (M = 15, SD = 3, SEM = 1) (group by pre/post-acquisition interaction; F 1,20 = 9.3; p < 0.01)—even after excluding the spike in intake of the identified binge session (Fig. 5a, left). Pre-acquisition infusion rates were not reliably different between binge-like (M = 1, SD = 1, SEM = 0.3) and non-binge (M = 2, SD = 1, SEM = 0.3) (p = 0.1) subgroups, and the post-acquisition infusion rates were higher than pre-acquisition rates for both binge-like and non-binge (both p < 0.0001).

Fig. 5
figure 5

Behavior as a function of acquisition pattern, a Mean MDPV infusions (left) and mean quarter wheel rotations (right) as a function of acquisition-pattern group (Binge? = “YES” or “NO”) and acquisition phase (PRE vs. POST). Symbols for means separation (all p < 0.05): *YES ≠ NO; #PRE ≠ POST. Error bars represent SEM. b For the binge session (for “NO”-binge rats, the first session wherein acquisition criterion was met was used for comparison), the measures of total time-out drug-paired lever presses (left), mean post-reinforcement pause (PRP; middle), and % of PRP equal to the 20-s imposed time-out (%PRP = 20 s; right) are plotted as a function of wheel group (locked wheel, LW; unlocked wheel, UW) and acquisition-pattern group (except for %PRP = 20 s for which only binge rats are shown; %PRP = 20 s was 0 % for all “NO”-binge rats). Symbols for means separation (all p < 0.05): *YES ≠ NO; $LW ≠ UW. Error bars represent SEM

Within the unlocked-wheel group, post-acquisition quarter rotations of the binge-like rats (M = 230, SD = 275, SEM = 104) were less than one fourth of those observed pre-acquisition (M = 974, SD = 339, SEM = 128), while wheel activity was similar for the non-binge rats post-acquisition (M = 904, SD = 610, SEM = 352) and pre-acquisition (M = 855, SD = 474, SEM = 274; Fig. 5a, right). This interaction was confirmed by rmANOVA (acquisition-pattern group by acquisition phase interaction; F 1,8 = 5.3; p < 0.05) and post hoc comparisons between groups confirmed that wheel activity was lower for binge-like than non-binge rats post-acquisition (p < 0.05) but was not different for pre-acquisition sessions. Lastly, post hoc comparisons between pre- and post-acquisition sessions confirmed that for binge-like rats, wheel quarter rotations were lower post-acquisition than pre-acquisition (p < 0.01), but for non-binge rats, the post- and pre-acquisition wheel activity was equivalent.

Time-out lever presses and post-reinforcement pause on the binge session

Examination of the behavioral events recorded in real time during the drug intake binge (again, this was defined by ≥8 maximum infusions in a given 5-min interval) showed intervals wherein infusion rate was constrained only by the 20-s post-reinforcement time-out (during which drug-associated lever presses did not have scheduled consequences). Therefore, drug-associated lever presses during the imposed time-out, the mean duration of post-reinforcement pauses (PRPs), and the percentage of PRPs equal to 20 s (%PRP = 20 s) were compared as a function of acquisition-pattern group and wheel group (Fig. 5b), to further analyze and characterize behavioral responses.

For locked-wheel rats, time-out lever pressing was higher for binge-like rats (M = 1574, SD = 921, SEM = 376) than non-binge rats (M = 201, SD = 123, SEM = 50) (U BAP (6) = 36, U NAP (6) = 0, Z = −2.9, p < 0.0125). However, for unlocked-wheel rats, although time-out lever pressing for binge-like rats (M = 780, SD = 470, SEM = 178) was higher than that of non-binge rats (M = 162, SD = 31, SEM = 18), this difference did not reach the criterion for statistical reliability. Time-out lever pressing did not differ between locked-wheel and unlocked-wheel rats for either binge-like or non-binge subgroups.

Within the locked-wheel rat group, mean PRP duration was shorter for binge-like rats (M = 26, SD = 5, SEM = 2) than non-binge rats (M = 98, SD = 64, SEM = 26) (U BAP (6) = 0, U NAP (6) = 36, Z = −2.9, p < 0.0125). Although the mean PRP for binge-like rats within the unlocked-wheel group (M = 34, SD = 5, SEM = 2) was shorter than that of the unlocked-wheel non-binge rats (M = 261, SD = 193, SEM = 111), this difference was not statistically reliable. The PRP was reliably shorter for locked-wheel than unlocked-wheel binge-like rats (U LW (6) = 3, U UW (7) = 39, Z = −2.6, p < 0.0125), but across all of the non-binge rats, the locked-wheel and unlocked-wheel groupings were equivalent.

Lastly, for the binge-like rats the percentage of PRPs equal to the minimum of 20 s was greater for the locked-wheel rats (M = 38, SD = 19, SEM = 8) than the unlocked-wheel rats (M = 16, SD = 8, SEM = 3) (t (11) = 2.8, p < 0.05). None of the non-binge rats had a PRP equal to 20 s on their session of acquisition.

Discussion

These data show that the initial acquisition and stabilization of intravenous self-administration (IVSA) of 3,4-methylenedioxypyrovalerone (MDPV) in rats reduce their activity on a wheel when the options are presented concurrently. Thus, the opportunity for drug self-administration replaces or diminishes the reward value of wheel activity, despite the fact that sufficient time exists within an operant conditioning session to engage in both behaviors at the observed rates. This appeared to be an abrupt phenomenon, occurring within a single session particularly when a binge was observed. Although the within-session effect on the binge day might reflect the induction of stereotypies or other competing behavior that precludes wheel running, this cannot account for the continued pattern of low wheel activity in subsequent sessions. This confirms and extends our prior study with methamphetamine (MA) self-administration (Miller et al. 2012). Mean wheel activity in the initial few sessions was nearly identical to that previously reported for the Wistar group in Miller et al. (2012), and the mean number of infusions obtained for the last five sessions was 16 infusions (SEM = 1), similar to the mean of 19 infusions obtained by lever-trained Wistar rats responding under an FR5 contingency (Aarde et al. 2013b), which enhances comparison across studies. Similarly, the mean lever discrimination ratio was 80 % (SEM = 3) in the last five sessions which was similar to that reported previously for an FR5 schedule of reinforcement (86 %) in a group previously lever trained (Aarde et al. 2013b). These studies therefore concur with the prior reports showing that MDPV is a highly effective reinforcer in rat self-administration models (Aarde et al. 2013b; Watterson et al. 2014).

Gradual acquisition of MDPV IVSA within the group (Fig. 1a) was related to a progressive reduction in the amount of wheel activity (Fig. 1c), in a pattern similar to the effects of MA self-administration on rats in our prior study (Miller et al. 2012). In that study, however, the use of lever pre-training resulted in substantial group MA intake from the very first session so it was not possible to observe any transition in drug intake and wheel activity within individuals. In this design, it was possible to observe such transition points and six of the 13 animals in the wheel-access group exhibited single-session transitions (Fig. 3) in which relatively low drug intake and high wheel activity were replaced by increased drug IVSA and decreased wheel use. An additional three animals exhibited patterns in which wheel activity declined only after an interval of several operant conditioning sessions which consisted of stable and relatively high drug intake. Another three individuals maintained sustained wheel activity and failed to acquire MDPV IVSA, and one individual sustained both robust wheel activity and relatively high MDPV intake. Thus, the plurality of individual trajectories supports the conclusion that as MDPV IVSA was acquired within an individual, the wheel was devalued in most individuals. Furthermore, the emergence of a binge-like, single-session acquisition pattern in about half of the wheel-access rats was related to the greatest reduction in wheel activity (Fig. 5a). Although qualitatively, it appeared that the binge may have been delayed in the group with unlocked wheels (Fig. 3), the present design does not support an authoritative conclusion; additional investigation with larger groups might show whether wheel access delays the occurrence binge session.

The fact that MDPV IVSA was not reduced by concurrent wheel access, as has been shown for cocaine (Cosgrove et al. 2002) and MA (Miller et al. 2012), may be due to its relatively high efficacy as a reinforcer, i.e., compared with methamphetamine (Aarde et al. 2013b). Similarly, environmental enrichment protects against the escalation of cocaine intake under long-access conditions only when the per-infusion dose is relatively low (Gipson et al. 2011). These results are consistent with growing evidence that indicates that MDPV is a more efficacious and/or potent reinforcer than is MA in rat self-administration (Aarde et al. 2013b; Watterson et al. 2014). This relative advantage may explain why concurrent wheel activity caused a mean difference in initial MA intake (Miller et al. 2012) but not MDPV intake in the present study.

Individual differences in the effect of concurrent wheel access are critical, since only a subset of humans exposed to a wide variety of psychotropic drugs will become dependent (Anthony et al. 1994; Schramm-Sapyta et al. 2009). As Ahmed and colleagues have argued (Ahmed et al. 2013), animal models of the transition point beyond which drug taking supplants other sources of reinforcement may be advantageous to further our understanding of the individual differences in resilience against compulsive drug use in humans. Rats prefer saccharin or sucrose flavor to cocaine in a direct choice (Lenoir et al. 2007), and only with extended access to heroin will rats reverse their preference for saccharin and choose each reinforcer about equivalently (Lenoir et al. 2013). Given the ecological value of cues for high-value food, these paradigms may not be the most sensitive. While wheel activity is highly reinforcing in laboratory rats, the present data suggest that it may present the ideal contrast with drug IVSA for such investigations.

The binge-like, single-session acquisition that was observed in half of the rats across the entire study differs from traditional concepts of drug self-administration binges. Roberts and colleagues (Morgan et al. 2005; Roberts et al. 2002) have defined binges as the opportunity for extended drug access across many hours to an entire day and Miczek and colleagues (Fowler et al. 2007; Tornatzky and Miczek 2000) similarly define a binge as high intake rates during a relatively long period of extended access (e.g., 24 h). However, in both cases, the binge occurs after acquisition and usually after a manipulation intended to increase intake levels (e.g., extended/long daily access sessions or social defeat stress). Indeed, the 6-h escalation paradigm pioneered by Ahmed and Koob (Ahmed and Koob 1998, 1999) has been shown to result in gradual mean increases in drug intake during the first hour of sessions that is sometimes characterized as an induced “loading-phase” binge. In contrast, the binge-like behavior in this study was observed spontaneously and was associated with the first days of acquisition. Thus, although the binge observed in these studies was characterized by relatively unrestrained, high intake levels—much as would occur for binge drinking or eating—the position of this binge near the beginning of acquisition and without prior intake-augmenting manipulations sets it apart from that traditionally described in drug self-administration studies.

Further experimentation will be necessary to determine if the binge experience causes the subsequent higher intake or simply serves as a marker for subsequent intake levels. Evidence that favors the former is that, in rats, the increased consumption levels of sweetened shortening that are produced by prior experience with a binge-inducing schedule (i.e., limited, intermittent access) are not observed if this binge experience is prevented by limiting the amount of shortening available (i.e., experiencing the binge-inducing schedule is not sufficient to increase later consumption—the binge experience must also occur) (Wojnicki et al. 2008). If these binge-eating results generalize to drug binging, then it is likely that similar binge-limiting interventions during the theorized initial binge/intoxication phase of addiction could prevent or retard the progression of addiction. This possibility increases the promise of immunological protection (e.g., antibodies with drug affinity (Miller et al. 2013) and gene therapies (e.g., transfection with genes for drug-metabolizing enzymes (Anker et al. 2012) to decrease the incidence of addiction among at-risk individuals by interfering with initial, binge-like drug experiences.

It is also unknown if this binge acquisition pattern is unique to MDPV or if it is a specific consequence of the high reward/reinforcer properties of MDPV. There are hints of a similar pattern in cocaine self-administration (Carroll and Lac 1993), but few reports that directly address this phenomenon are available. If the differing reward/reinforcer properties of different stimulants determine the frequency and magnitude of a binge-like acquisition, then it is noteworthy that the reinforcer efficacy of a drug appears to be positively correlated with the relative actions of that drug upon dopamine and serotonin levels in the brain (Bauer et al. 2013) and, furthermore, that MDPV tops this list of stimulants with regard to its half-maximal dopamine transporter (DAT)/serotonin transporter (SERT) inhibitory concentration (IC50) ratio with a ratio of ∼300 (compared to cocaine at a ratio of ∼3) (Simmler et al. 2013); also see (Baumann et al. 2013; Cameron et al. 2013). As responding for cocaine under a progressive-ratio schedule (a well-validated measure of reinforcer efficacy) can be increased by serotonergic lesions of the medial forebrain bundle and amygdala (Loh and Roberts 1990), it is likely that MDPV’s high selectivity for enhancing dopamine levels over serotonin levels sets this drug apart from other stimulants as a reward/reinforcer. Indeed, recent evidence indicates that MDMA, a drug that is relatively weakly reinforcing (DAT/SERT IC50 ratio = 0.08; (Simmler et al. 2013)), is self-administered more readily after similar serotonergic lesions and in SERT knockout rats as compared to controls (Bradbury et al. 2013; Oakly et al. 2013). In other words, current data suggest that, as a consequence of its neurochemical specificities, MDPV may function as a “super reward” that elicits binge-like intakes at the onset of use (Aarde et al. 2013b; Simmler et al. 2013).

In summary, this study confirms the high reinforcing value and therefore likely high abuse liability, of MDPV. In contrast to our prior findings with methamphetamine, concurrent wheel access did not significantly affect mean MDPV intake. This shows that there may be specific parameters of drug identity, behavioral training history, etc. that may affect the shifts between drug self-administration and wheel activity. In the present study, the acquisition of MDPV self-administration supplanted wheel activity as a reinforcer in rats, which is similar to the group mean effects shown for MA in Miller et al. (2012). Examination of the individual patterns of wheel activity and MDPV infusions showed that this is essentially a one-trial effect in many individuals. Once a rat has experienced a relatively high-dose self-administration session, the wheel is subsequently used less than that individuals’ baseline, thereby confirming that MDPV use can rapidly attenuate the appeal of a natural source of reward. Access to a wheel is otherwise highly reinforcing in laboratory rats (Belke and Heyman 1994; Hundt and Premack 1963), and therefore, the present results show the rapid devaluation of this naturalistic reinforcer by the acquisition of stimulant drug IVSA. As such, this paradigm models transition states in rats on both a group and individual basis.