Keyword

1 Introduction

Anorexia nervosa (AN) is a severe psychiatric disorder associated with food avoidance, severe emaciation, and the highest mortality rate among the psychiatric disorders (Sullivan 1995; APA 2000). Clinical evidence suggests that individuals with AN have disturbances in the processing of naturally rewarding stimuli, such as food. The processing of food reward is complex and modulated by cognitive, emotional, and biologic factors that involve learned behaviors and genetic predisposition. In the brain, cortical and subcortical networks integrate external food stimuli with the internal drive and motivation to eat, resulting in behavior activation and approach or avoidance of food. Brain regions implicated in this process include the insula, the basal ganglia (anteroventral and dorsal striatum, nucleus accumbens), the ventral tegmental area (VTA), the anterior cingulate, orbitofrontal, and mesial temporal cortex, as well as the hypothalamus. This network, called the reward system, processes food and other stimuli in order to “achieve an optimal stimulus response” (Cooper et al. 2003). The function of the reward system in patients with AN has been inadequately studied. Clinically, AN subjects are able to refrain from eating over prolonged periods of time and do not respond to mechanisms that would, in most people, stimulate the drive to eat and approach food. Recent studies in AN subjects suggest abnormal brain activity in striatal parts of the basal ganglia that are related to dopamine receptor function (Frank et al. 2005; Wagner et al. 2005). These brain regions have been associated with the processing of immediate and delayed rewards in control subjects, how much reward someone may expect from a stimulus in the future, as well as the amount of motivation a person has to approach a possible reward. In this chapter we describe the use of a model founded on neuroscience that is primarily based on brain dopamine neurotransmission and brain imaging, and we discuss how such an approach can be used to identify pathways directly linked to underlying neurobiologic abnormalities in AN. Specifically, computer models based on basic science research can identify specific neuronal brain function, and results can be used to validate fMRI results and identify disorder-specific abnormalities.

2 Possible Mechanisms Underlying Reward System Abnormalities in AN

The motivation to eat and approach food is an important part of the reward pathway and is particularly disturbed in AN. AN women like sweet stimuli and do not deny having an appetite (Schweiger and Fichter 1997). Clinically, AN individuals frequently express some desire to eat and gain weight but usually continue to be drawn to the goal of weight loss and food restriction. Moreover, acting on the desire to eat is even more difficult, and thus AN individuals seem unable to direct their behavior and essential motor function toward necessary goals of weight gain. While the AN patient still has the ability to eat, move, experience taste, and even likes sweet stimuli more than controls (Drewnowski et al. 1987), she cannot activate herself sufficiently to eat adequately. This pattern is reminiscent of dopamine-deficient mice (Cannon and Palmiter 2003). These rodents like sweet sucrose or noncaloric artificial sweetener solution more than water, but show more infrequent initiation of drinking compared to controls. They are hypophagic (Zhou and Palmiter 1995) and will die of starvation even with readily available food in their cage, without motor impairment being the cause for the hypophagia. It was hypothesized that these mice have a reduced capacity to respond to rewarding stimuli [“lower basal and maximum response capacity” (Cannon and Bseikri 2004)]. Thus, dopamine-deficient mice may have a reduced motivation or capacity to obtain rewards and may not be able to sufficiently direct their behaviors toward the appropriate goals (Cannon and Bseikri 2004), such as food intake and maintenance of a normal body weight. AN, in fact, has been associated with reduced brain dopamine and abnormal dopamine receptor function in the anteroventral striatum (Kaye et al. 1999; Frank et al. 2005). Many cognitive prefrontal processes, such as fear of weight gain, cannot be addressed in the dopamine-deficient mouse model. However, low anteroventral striatal dopamine-related activity may be associated with less-afferent prefrontal input (Montague et al. 2004) and decreased motivation to approach food rewards. While speculative, it is possible that AN individuals possess a reduced drive to approach food stimuli in conjunction with overvalued ideas and desires for thinness and fear of weight gain. Together, such a constellation may lead to an imbalance of cortical and subcortical drives resulting in reduced food appetence and subsequent weight loss. Another function of the dopamine system is alerting the individual for novel stimuli and initiating learning from those experiences (Schultz 1998, 2002). It appears that the initial phase of food reduction and weight loss is associated with a sense of excitement (Bergh and Sodersten 1996); however, long-term or chronic anorexic subjects do not experience such happiness or excitement but are merely trying to avoid or ameliorate perceived aversive states, such as weight gain or feelings of being fat (APA 2000). This behavior seems to have aspects in common with substance use withdrawal. Koob and Le Moal (Koob and Le Moal 2005) describe a compulsive state that follows the intoxication of a drug addict characterized by withdrawal and negative affect, emotional pain, alexithymia, and dysphoria. Instead of the positive rewards experienced during the initial phase of drug use, plastic changes in the reward pathways create an antireward system that “drives aversive states” (Koob and Le Moal 2005). This negative antireward system has been associated with altered dopaminergic states. AN and drug users may lie on the opposite ends of the spectrum with increased dopamine receptor binding in AN but reduced dopamine receptors in the substance users (Volkow et al. 2004). AN subjects are chronically dysphoric, have difficulties expressing their emotions, and cannot enjoy things as much as others (Anderluh et al. 2003), and we believe that AN subjects might be stuck in an antireward system with reduced dopamine function, whereas such a state is only temporary in drug users before they self-medicate with another hit or drink. Interestingly, alcoholism (Kampov-Polevoy et al. 2003) and benzodiazepine use (Yasoshima and Yamamoto 2005) have also been associated with altered sweet taste response. In summary, there may be a combination of dopamine abnormalities, either trait or state related, and psychologic factors that control eating behavior and influence the motivational system.

3 Taste and Reward System Activation Research in AN

Taste by itself is a reward system activator, at least in part through dopamine activation in the brain. Taste receptors on the tongue project to the brain stem nucleus of the solitary tract (NST) and from there to the parabrachial nucleus (PBN). Then the taste pathway bifurcates to a thalamic/cortical tract and the hypothalamic/limbic pathway with projections to the amygdala and nucleus accumbens (Norgren et al. 2006; Hajnal et al. 2009). Thus, taste directly stimulates the dopamine pathway. Taste paradigms in AN can be applied as long as disorder-specific mechanisms are taken into account. Sweet taste perception is overall preserved in AN, but the hedonic experience of sweet taste is particularly biased by the fear of weight gain. Several studies have investigated taste perception in AN in the past. Some found in AN subjects reduced (Toth et al. 2004) or altered (Drewnowski et al. 1987) taste sensitivity while ill and after refeeding. Others found no disturbance in the ability to rate sweet perception (Di Costanzo et al. 1998) but found altered hedonic ratings compared to controls, where AN preferred sweeter stimuli (Sunday and Halmi 1990). Another group did not find such abnormalities (Simon et al. 1993). Exercising is a possible confound of hedonic taste perception studies as well, since more than 3 h of activity per week decreased preference ratings for high-sucrose and high-fat stimuli in controls (Crystal et al. 1995). This is important since women with AN frequently exercise. Hedonic taste response is heavily biased by the perception of calorie intake and the possibility of weight gain as a deterrent to enjoy food. In fact, one study found that sweet taste pleasantness ratings were preserved in a paradigm where AN did not swallow but spit out the sweet (sugar) taste stimulus (Eiber et al. 2002). In the same paradigm, hedonic ratings were reduced when subjects had to swallow the sugar stimulus, suggesting strong cognitive bias toward the high-caloric fluid. Taken together, there is limited evidence for taste perception alterations per se in AN, although the studies performed are few with small sample sizes. Potentially more important, the hedonic rating results tend to be biased by an overriding fear of weight gain.

In summary, taste response can be used in AN to test reward system abnormalities. However, the cognitive and emotional activation specific to AN has to be taken into account in the model.

4 Reward Pathway Function and Neurobiology

The ideal function of the brain reward system is to ensure that we make the best choices in response to environmental stimuli (Cooper et al. 2003) in order to support our long- and short-term goals. Natural rewarding food-related stimuli have been studied extensively (Saper et al. 2002). A large number of studies have also examined recreational drugs, alcohol, monetary reward, and other potentially rewarding stimuli (Martin-Soelch et al. 2001; Grigson 2002). The mechanisms of reward processing in the brain, though, are shared across stimuli (Grigson 2002). The main psychological components of this system are motivation, learning, and emotion or affect (Berridge and Robinson 2003). One line of research distinguished conscious “liking,” relating to an emotional response to stimuli, from “wanting” which relates to intrinsic motivation reflective of how much an individual is willing to work for a reward. For the concept of wanting, the term “incentive salience” of stimuli has been proposed (Berridge 1996; Berridge and Robinson 1998). An underlying principle of reward processing seems to be “the anticipation of outcomes of behavior in situations with varying degrees of uncertainty” (Schultz 2006). That means we frequently have to make choices in an environment with various types of stimuli with respect to which of those stimuli might fit best our desires and needs. A stimulus can have several rewarding effects for an organism (Wise 2002). The brain tries to assess and anticipate the outcome obtained from a stimulus, weighing between available stimuli in order to determine which should be approached based on the highest reward value. While there are genetically determined contingencies of reward, for instance the innate liking of sweet taste (Bartoshuk and Beauchamp 1994), learning, and conditioning, reward-predicting experiences are important for the development of personal preference and reward system activation (Schultz 2006). From our exposure to stimuli and positive or negative reward experience, we learn what to approach and what to avoid. While we develop representations in the brain of what we like and what we do not, new experiences update these representations continuously based on their reward value.

Various neurotransmitters and regions in the brain have been found to process rewarding stimuli. Corticostriatal–hypothalamic pathways have been implicated in the receipt of sensory stimuli and subsequent processing of those stimuli’s reward value, followed by behavioral activation in order to approach or avoid stimuli (Kelley et al. 2005). Dopamine pathways, mainly in the VTA, striatum, and nucleus accumbens, extending to the orbitofrontal cortex, may be involved in the approach and motivational aspects of related behavior (Kelley et al. 2005; Schultz 2006). Thus, dopamine activation may be associated with the “wanting” concept described by Berridge (Berridge and Robinson 2003). Opioid neurotransmission is localized closely to dopaminergic neurons in the ventral and dorsal striatum. Opioid activation in these areas, regulated by cholinergic interneurons, may mediate the pleasure that is associated with stimuli, and GABA neurotransmission into the anteroventral striatum may prevent excessive response to rewarding stimuli (Kelley et al. 2005). In addition, there appears to be a fine balance of opioid receptor-driven activation of dopamine and GABA neuronal activation of the reward system via glutamatergic neurons (Mansvelder 2005; O’Reilly et al. 2007).

The prefrontal cortex exerts voluntary control over behavior (Price 2005) and is an important input for the reward system (Wise 2005) in terms of deliberate control and bias toward behavior. It appears that afferent phasic dopamine input to the prefrontal cortex may be needed in order to modulate goals based on rewarding experiences (Montague et al. 2004). These dopamine pathways are influenced by various factors. Food restriction may increase dopamine receptor binding in the nucleus accumbens and caudate/putamen (Carr et al. 2003), and dopamine D2 receptors may regulate dopamine release (Cooper et al. 2003). Alterations in this mechanism may be particularly involved in substance use but also food-related disorders (Blum et al. 1995). In addition, nonpredictability is an important determinant for the activation of the anteroventral striatum dopamine system since unexpected stimuli elicit a greater response in functional dopamine transmission (Berns et al. 2001). Furthermore, serotonin pathways have been hypothesized to alter the mediation of brain reward response (Higgins and Fletcher 2003) with, for instance, serotonin 2C receptor activation attenuating drug-seeking behavior.

5 Reward Brain Imaging Studies and Anorexia Nervosa

Elements of the brain reward system have been identified through anatomic and electrophysiologic studies in nonhuman primates and neuroimaging studies in humans. Single-cell firing rates within the basal ganglia, VTA, nucleus accumbens – a part of the ventral striatum, amygdala, orbital–frontal and prefrontal cortex – are modulated by rewarding stimuli (Apicella et al. 1991; Hikosaka and Watanabe 2000; Schultz et al. 2000; Wise 2002). Neuroimaging studies using a variety of reward stimuli have corroborated the electrophysiology data. In particular, studies using monetary rewards have revealed patterns of reward-related activity in the amygdala, orbital–frontal and mesial prefrontal cortex, cingulate cortex, and portions of the striatum to which these limbic regions project (Delgado et al. 2000, 2003; Knutson et al. 2000; Breiter et al. 2001; May et al. 2004). The ventral striatum is involved in reward processing per se, while the dorsal striatum is involved in linking action to outcome (O’Doherty et al. 2004; Tricomi et al. 2004). Like the ventral striatum, more dorsal sectors of the striatum are also modulated by dopamine. However, these regions are also implicated in motor and cognitive control, specifically the learning of stimulus response associations. The caudate nucleus is activated by those tasks in which there exist both a perceived connection between action and outcome and some uncertainty about whether the action will lead to the desired outcome (Tricomi et al. 2004). O’Doherty and colleagues (O’Doherty et al. 2004) postulated that projections to the ventral striatum might be involved in reward prediction (“critic”) and instrumental learning, whereas projections to dorsal striatum (“actor”) might be involved in the modulation of stimulus response in order to choose the best behavioral response based on previous experience. Most recently, immediately rewarding processes have been related to ventral striatal activation, whereas delayed gratification may activate dorsal striatal brain response (Tanaka et al. 2004). This is important since, phenotypically, AN have a very high degree of self-control and are able to delay gratification in various aspects of life. Aside from delaying eating, they are extremely goal oriented and perfectionistic, they tend to work harder than peers, and they seem to prefer delaying reward over not meeting their immediate goals and standards. Abnormalities in the anteroventral and dorsal striatal reward system in AN could contribute to food restraint and the ability to refrain from commonly desired pleasurable experiences for long periods of time. Taken together, brain imaging in human controls show that testing reward pathway activation in vivo is possible, and that results obtained are consistent with the animal literature and single neuron-recording studies. It is debated whether brain imaging such as fMRI can identify pathways directly related to neurotransmitter function. Recent data provide evidence suggesting that this is in fact the case (see limitations).

Very few studies exist that specifically assessed reward system brain activation in AN. One study suggested dopamine abnormalities in AN after recovery (Frank et al. 2005), while functional magnetic resonance imaging studies in recovered AN also showed abnormalities in response to a monetary gain task (Wagner et al. 2007). Ill AN were recently shown to have increased ventral striatal response to pictures of underweight bodies. Taken together, the few studies available suggest reward system abnormalities in AN, but the neurobiologic underpinnings remain largely unknown.

6 Neurocomputational Modeling and Brain Imaging

The integration of animal models with human in vivo studies remains a crucial step in translational research. Computer models should be able to predict human behavior (based on the basic research findings) using specific tasks that apply the animal model in the human. That is, the computer model is used to test whether the brain imaging results gathered during the task application indeed reflect the brain activation obtained in the previously validated basic research. If this is the case, then one can make actual inferences on human neurobiologic brain processes related to disease.

6.1 Principals of the Temporal Difference Model

Behavioral models have been developed that mathematically describe aspects of reward under conditions that can be related to brain imaging results. This sets the stage for “Hypothesis-driven research” that “has been shown to be an excellent model for pursuing investigations in neuroscience” (Koslow 2005). Such neurocomputational models link psychological theories with underlying neural substrates. In this chapter, we describe the application of the temporal difference (TD) model (Schultz 2006) and will provide preliminary data.

Various models describe concepts of reinforcement learning (Sutton and Barto 1981), which both concerns the anticipatory control of actions and prediction of potential reward values (Worgotter and Porr 2005). The TD model has been intensively studied and applied to the neural reward system and is a valid model for studying reinforcement learning and dopamine neuronal pathway activity in the basal ganglia. It has been studied in vivo in relation to dopamine system activation in rodents using single-neuron recordings (Schultz 1998, 2002; Schultz et al. 2000) as well as in humans using brain imaging techniques (O’Doherty et al. 2003, 2004; Tanaka et al. 2004). This model (Sutton and Barto 1981; Worgotter and Porr 2005) integrates elements of the individual “value” of a reward and the prediction of its rewarding property in the future. Since rewards do not always occur and are not entirely predictable, a temporal difference error is introduced in the model that functions as a “learning signal.” This error is dependent on whether the predicted reward is consistent with the received reward or if there is a discrepancy between prediction and reality (error). Another element in the model is the discount factor, which takes into account how many times a reward has been received and how many times the subject has had the opportunity to learn from experience to derive predictions.

6.2 Temporal Difference Model Study Design

The study design that we use was originally described by O’Doherty et al. (2003). Each trial consists of the presentation of one of three arbitrary visual stimuli (see Fig. 1) followed 2 s later by 1 ml of a pleasant sweet taste (1 M sucrose) (CS+), a neutral tasteless solution [CSneut: 25 mM KCl, 2 mM NaHCO3, (Francis et al. 1999; Frank et al. 2003)], or no taste (CS−). The visual stimuli are presented on a gray background and are removed from the screen after 2 s to coincide with taste delivery. A fixation cross is then presented for the remainder of the trial (4 s). The allocation of each stimulus to a given trial type is counterbalanced across subjects. There is a total of 280 trials in the experiment, 100 each of CS+ and CS− and 80 of CSneut. The whole experiment lasts a total of ~28.5 min. The first ten CS+ stimuli presented are paired on each occasion with reward (sweet taste), in 20 out of 90 subsequent CS+ presentations, the reward is omitted (CS+ omit). Further, in 20 presentations of the CS−, a reward is unexpectedly delivered (CS-unexpreward). The CSneut condition is primarily included to provide a low valence rinse for the glucose taste during the experiment. The presentation order is randomized and delivered to subjects using E-Prime (Psychology Software Tools, Pittsburgh, PA).

Fig. 1
figure 1_81

Schematic tasks application, modeled after O’Doherty et al. (2003). Subjects learned to associate a specific fractal with a particular taste. In 20% of cases, there was a mismatch of fractal and expected taste (“Sucrose fractal” but no taste delivery; “Nothing fractal” but Sucrose delivery). A total of 280 trials are applied; every 2.1 s a brain scan is acquired

6.3 Data Analysis

Image processing is done with the SPM 5 software package (http://www.fil.ion.ucl.ac.uk/spm/doc/spmbib.html). We performed whole brain analyses and also used region of interest-based analyses that include the VTA, ventral striatum (VS), amygdala, orbitofrontal cortex, and cingulate cortex for percent signal change extraction. There is a primary and a secondary analysis pathway in the paradigm. The primary analysis pathway identifies brain region activation in response to the conditioned fractal or unconditioned sweet taste application as well as brain response when taste is expected by not received. Main effect results are obtained for condition and subject group, as well as time activity curve data that illustrate brain activation over time in response to the individual stimulus.

The secondary analysis pathway involves the mathematical models that try to bridge brain function and actual behavior. As described earlier, computational modeling tries to explain a real-world problem in a simplified model that then gets translated into a mathematical equation that can be used to correlate with brain imaging results. This serves the purpose of predicting brain response following a stimulus, and depending on factors such as learning rate and received reward, different brain regions are implicated. That is, the subject performs a specific task or Reward Paradigm and learns under which conditions reward occurs. The time course of reward experience is known by the experimenter. In the taste experiment, reward delivery or omission is part of the paradigm and is recorded by the stimulus-applying computer. Those reward receipt or loss experiences are assigned a reward value in the computational model, and those reward values are then correlated with the functional magnetic resonance imaging brain response.

7 Use of Neurocomputational Models and Testing of Alternate Models (Fig. 2)

On each trial, the predicted value (V) at any time (t) within a trial is calculated as a linear product of the weights (w i ) and the presence or absence of a CS stimulus at time t, coded in the stimulus representation vector x i (t):

Fig. 2
figure 2_81

Principals of the Neurocomputational Modeling Approach. A study participant in the functional magnetic resonance brain imaging (fMRI) scanner performs the taste task. The blood oxygen level-dependent (BOLD) brain response is measured. The fist brain activation analysis tests in the whole brain activations that relate to the schedule of reward application and omission, reward expectation, etc. In the second analysis pathway, we use the individual reward task schedule data for each participant and import those data into a computer algorithm that computes the expected brain response based on the animal literature. Those outcomes are then regressed with the human in vivo data in order to (1) test whether significant regression results indeed occur in the expected brain regions and (2) make group comparisons whether controls have a stronger or weaker activation pattern to a particular learning rate than the anorexia nervosa group

$$ \hat{V}(t) = \sum\limits_i {{w_i}{x_i}(t){.}} $$

Learning occurs by updating the predicted value of each time point t in the trial by comparing the value at time t + 1 to that at time t, leading to a prediction error or δ(t):

$$ \delta (t) = r(t) + \gamma \hat{V}\left( {t + 1} \right) - \hat{V}(t){,} $$

where r(t) = reward at time t. The parameter γ is a discount factor, which determines the extent to which rewards arriving earlier are more important than rewards that arrive later on. Similar to O’Doherty’s study (O’Doherty et al. 2003), we set γ = 0.99. The weights w i are then updated on a trial-by-trial basis according to the correlation between prediction error and the stimulus representation:

$$ \Delta {w_i} - \alpha \sum\limits_t {{x_i}(t)\delta (t)} {,} $$

where α = learning rate. We assign six time points to each trial and use each subject’s individual event history as input. On each trial, the CS (visual fractal) is delivered at time point 1, and the UCS reward (sweet taste stimulus) is delivered at time point 3. Like O’Doherty (O’Doherty et al. 2003), we use as learning rate of the model (α) two values for α: a lower learning rate (α = 0.2) and a higher learning rate (α = 0.7). In the original study, the lower learning rate best modeled brain activation in the ventral striatum and orbitofrontal cortex. The temporal difference error δ(t) is created for the CS and the positive and negative UCS for each time point, and an ideal activation curve is created and convolved with the hemodynamic response curve. That curve is then modeled to the functional magnetic resonance imaging data in order to generate brain regions that respond according to the model. We cannot predict whether restricting-type anorexia nervosa women have simply reduced brain response to the stimuli or if a different learning rate explains altered learning response. This is an empirical task during data analysis. Reward prediction error: The TD error is calculated from the difference between the actual reward r(t) and the temporal difference of the estimated value function V(t). We vary the discount factor γ as 0, 0.3, 0.6, 0.8, 0.9, and 0.99: small γ for immediate reward prediction and large γ for long future reward prediction. We then use either V(t) or δ(t) as the explanatory variable in a regression analysis in the SPM brain imaging analysis program for correlation with brain activity.

8 Preliminary Results

We recruited 15 ill AN and 17 CW for this study. Both groups were similar in age, but AN individuals had higher scores on the Eating Disorders Inventory 3, Beck Depression Inventory, and lower body mass indices (BMI; data not shown). Between-group pleasantness and sweetness ratings were similar for both the Sucrose and Artificial Saliva Solutions.

Within groups, the unexpected receipt or omission of the sucrose taste stimulus activated dopamine-related brain regions such as the striatum, VTA, insula, and cingulate cortex. The whole brain group comparison indicated that unexpected receipt of the sweet taste stimulus activated brain dopamine regions more in AN than in controls, and unexpected omission of that stimulus caused more of a negative response in AN compared to the CW. These results suggest that brain taste-reward pathways linked to central dopamine transmission are functional but are overactive in ill AN compared to CW when receiving the stimulus unexpectedly, or during the omission of the taste. This response may thus suggest an increased temporal difference response in AN compared to CW. Such an over-responsiveness is consistent with clinical observations in EDs of heightened fear to novel or unexpected food stimuli, which results in stimulus avoidance and behavioral inflexibility.

In support of our biologic finding, a recent report showed that ill AN are hyper-sensitive to both rewarding as well as punishing stimuli (Jappe et al. 2010).

Furthermore, we imported the subject-specific reward schedule data (reward schedule during the brain imaging procedures) into a computer simulation and generated preliminary data for the VTA and AVS. As expected, the regression of the outcome activation data regressed in both CW and AN individuals significantly not only with VTA and AVS but also the insula and parts of the orbitofrontal cortex. This suggests that this dopamine-proposed mechanism involves other higher order function brain regions.

9 Limitations of the Model

The use of model-based fMRI is still in its infancy in psychiatry and there is still a lot to be learned. Several questions are still debated. The exact relationship between blood oxygen level dependent (BOLD) and brain function is not clear, such as the relationship of BOLD to action or response for instance. Evidence suggests that BOLD may reflect local processing of incoming stimuli rather than neuronal outflow to other brain areas (Logothetis 2002). Another key issue here is whether the BOLD fMRI does, in fact, reflect the proposed dopamine brain activation. The TD model used in our studies is dopamine based (Schultz 2002) and O’Doherty adapted the model to the fMRI environment (O’Doherty et al. 2003). Another carefully designed study showed that the VTA also can reflect dopamine activation when using the appropriate tasks (D’Ardenne et al. 2008). While results suggest dopamine is responsible for the BOLD response, the mechanism for such a relationship is not clear. Recent studies, however, indicate that BOLD fMRI in fact does reflect dopamine activation. Knutson for instance proposed that dopamine nucleus accumbens activation would stimulate dopamine D1 receptors and activate the BOLD signal via membrane potential changes (Knutson and Gibbs 2007). In summary, while there is still uncertainty about the exact mechanisms that determine BOLD response, increasing evidence indicates that DA function directly relates to BOLD fMRI response.

Another issue with our underlying dopamine concept is the number of brain regions involved in the model. The classic TD model focuses on the VTA and AVS and does not take other areas into account. This is a rather reductionistic model considering that emotional and cognitive processes, like learning and associations, impact the reward system activation. In response, O’Reilly created an extension to that model that takes into account the amygdala, hypothalamus, and orbitofrontal cortex (O’Reilly et al. 2007; Hazy et al. 2010). We are currently adapting computer models such as the Pavlovian Primary Reward/learned Reward model (O’Reilly et al. 2007; Hazy et al. 2010) to our imaging studies.

10 Future Outlook

We are still at the beginning of this line of research. However, results are encouraging and support the notion that we can actually study neurotransmitter-related brain pathways and identify patterns that are disorder related. This is an important step forward in the use of brain imaging, linking specific neurobiology in a psychiatric population. Such neurobiologic alterations could be used to improve treatment. For instance, making treatment very predictable may help ameliorate the dopamine hyper-responsiveness in AN, and we might be able to target specific problem behaviors with medication moderating dopamine activity.