Introduction

Astrocytes not only provide metabolic support and homeostatic control but are also increasingly recognized to play an active role in various traditionally neuron-centered cognitive processes, including learning and memory [1,2,3,4,5,6,7]. Genetic knockout and pharmacological manipulations of inositol 1,4,5-triphosphate receptors (IP3Rs) and other astrocytic G-protein-coupled receptors have uncovered essential roles of astrocytes in the modulation of cognition, including working memory (WM) [8, 9], hippocampus-dependent spatial memory [10, 11], emotional behaviors [12, 13], and sleep and arousal regulation [14]. Astrocytes are in close proximity to neurons and can sense local neural activity and brain-wide neuromodulatory inputs and contribute to these cognitive processes via bidirectional interactions between astrocytes and synapses as components of tripartite synapses [15,16,17]. Astrocytes also show characteristic Ca2+ activity in responses to sensory [18] and motor events [19] and evoke the Ca2+-dependent release of gliotransmitters [20,21,22]. Dysregulation of Ca2+ dynamics in vivo contributes to neurological diseases such as epilepsy and Alzheimer’s disease [23]. Understanding Ca2+ dynamics is critical to elucidating the role of astrocytes in physiological and pathological processes.

Astrocytic Ca2+ dynamics in vivo have been studied with two-photon microscopy and camera-based large-field imaging, but these studies are limited by imaging of cortical layers [24], by using anesthetized animals (which may alter neural responses) [25], and by using head-fixed awake mice, which introduce stress to alter astrocytic function [19, 26]. The exact dynamic role of astrocytes in these cognitive processes remains unclear due to the difficulties in directly and specifically recording the astrocytic Ca2+ dynamics in animals performing tasks. The recent development of genetically-encoded Ca2+ indicator (GECI)-based fiber photometry enables the recording of intracellular Ca2+ signal in defined cells of behaving (task-performing) animals and provides a new opportunity to correlate the astrocytic Ca2+ signal with specific cognitive processes [27,28,29]. Recent studies using GECIs have shown that the astrocytic Ca2+ signal can be triggered by sensory [18], odor [30], and visual [31] stimulation, as well as by locomotion and startle responses [32]. However, whether these sensory and neural inputs can trigger event-specific astrocytic Ca2+ patterns to participate in specific cognitive processes remains unknown. Specifically, it is unknown whether specific patterns of astrocytic Ca2+ dynamics may exist and be embedded in neural circuits in association with specific cognitive processes. If such a pattern exists, what information does it encode, and how does it specifically correlate with neuronal network activity?

WM is characterized by temporary storage and manipulation of information to instruct a behavioral response, and is the foundation of cognitive behaviors [33]. Previous studies on WM have mainly focused on the neurons or neuronal networks in control of WM processing [34,35,36]. Many pharmacological and genetic-knockout studies have also demonstrated the astrocytic modulation of WM [9, 37, 38]. For example, astrocytes can control WM by modulating their release of D-serine to “boost” the induction of long-term potentiation (LTP) [39], by providing astrocyte-derived L-lactate to enhance LTP as metabolic fuel for synaptic remodeling to influence rule-learning and the performance of WM [38, 40], and by cannabinoid receptor type-1 signaling via inhibition of transmitter release at GABAergic and glutamatergic terminals [8]. However, as these experimental manipulations of astrocyte activity result in long-term activity changes in astrocytes, whether specific patterns of astrocytic Ca2+ dynamics may exist in association with specific WM processing (at the time scale of seconds) remains unclear. In particular, WM processing consists of three distinct information-processing phases, encoding, maintenance, and retrieval, and each phase can be completed within a few seconds [41]. Indeed, the hippocampus → prefrontal cortex (PFC) neural pathway is critical for the encoding of spatial WM [42], while the PFC is essential for the maintenance of WM [36], and the thalamus and striatum are involved in the retrieval of WM [43]. To the best of our knowledge, there is no information on astrocytic Ca2+ dynamics during the distinct phases of WM, and, consequently, whether astrocytes temporally participate in these distinct phases of WM processing in real time remains unknown. Furthermore, emerging studies have shown the dynamic interactions between astrocytes and neurons in animals responding to different stimuli [30, 31, 44]. However, the question whether astrocytes can dynamically and temporally precisely interact with neurons at the population level during WM processing remains to be answered. Lastly, WM processing reportedly involves many brain regions, including the hippocampus (encoding phase), medial PFC (mPFC) (maintenance phase), and striatum (maintenance and retrieval phases) [45, 46], with the communications of neuronal networks among these WM-related regions [46, 47]. Whether there is communication, such as synchronization, of astrocytic Ca2+ networks among these WM-related regions [48, 49] is currently unclear.

Therefore, we designed experiments to elucidate the dynamic role of astrocytic Ca2+ at the population level in the distinct information processes of WM by coupling GECI-based fiber photometry with the T-maze-based delay-no-match-to-place WM task. While the mPFC is critical to the maintenance of WM, we focused on astrocytic Ca2+ in the hippocampus, since astrocytic Ca2+ is strongly triggered by sensory inputs, and this is most evident in the encoding and retrieval phases [50], and the hippocampus → cortex pathway is critical to the encoding of WM information processing [42]. Our analysis revealed the gating of astrocytic Ca2+ dynamics by sensory inputs and the internal state, a strong inter-locked and antagonistic relationship between the astrocytic and neuronal Ca2+ dynamics, and the synchronization of astrocytic Ca2+ signals responding to WM processing between the hippocampus, mPFC, and striatum. These findings support the existence of bidirectional and inter-locked communication between astrocytes and neurons at the population level to fine-tune the WM information processing.

Materials and Methods

Animals

The animal protocols were approved by the Institutional Ethics Committee for Animal Use in Research and Education at Wenzhou Medical University, China. Male C57BL/6 J mice aged 8 weeks were purchased from SPF Biotechnology Co., Ltd (Beijing, China), and housed at 24 ± 0.5 °C with a relative humidity of 60% ± 2% and under a 12-h light-dark cycle (lights on at 08:00). Except when food-restricted for behavioral training and testing, all mice were given ad libitum access to food and water.

Stereotaxic AAV Injection and Optic Fiber Implantation

The stereotaxic AAV injection and optic fiber implantation were performed as described previously [43]. Briefly, for the dual-color fiber photometry experiment, male C57BL/6J mice (aged 8–12 weeks) were intraperitoneal anesthetized with sodium pentobarbital (50 mg/kg, Boyun, Shanghai, China). Then they were injected with 300 nL of AAV2/9-shortGFAP-Gcamp6s and 300 nL of AAV2/9-hSyn-jRGECO1a simultaneously into the hippocampus [anteroposterior (AP), −2.4 mm from Bregma; mediolateral (ML), −1.75 mm; dorsoventral (DV), −1.3 mm] unilaterally. For the multichannel fiber photometry experiment, male C57BL/6J mice (aged 8–12 weeks) were simultaneously injected with 300 nL of AAV2/9-shortGFAP-Gcamp6(s) into the mPFC (AP, +1.96 mm; ML, −0.35 mm; DV, −1.65 mm), striatum (AP, +0.98 mm; ML, +1.60 mm; DV, −2.60 mm), and hippocampus (AP, −2.4 mm; ML, −1.75 mm; DV, −1.3 mm) unilaterally. The injection rate was 50 nL per min. Optical fibers (Hangzhou Newdoon Technology Co., Ltd, Hangzhou, China, outside diameter: 1.25 mm, core: 200 µm, numerical aperture: 0.37) were implanted 0.1 mm above the injection site. The viruses used in this study were from Obio Technology Co., Ltd (Shanghai, China) and had titers from 1.29 × 1013 to 1.69 × 1013 genome copies per mL. The behavioral tests were started 3–4 weeks after stereotaxic surgery to achieve sufficient virus expression.

T-maze-Based Delayed-Non-Match-to-Place (DNMTP) Task

The T-maze–based DNMTP task was conducted according to the protocol described previously [43]. Briefly, after expression of the AAV2/9 virus for ~3 weeks, mice were first subjected to food restriction for 3 days and then to habituation for another 3 days. The mice were then given 10 trials of training per day for the WM task. During the training and testing periods, the mice were also maintained under food restriction to keep their body weight at 85% of that before training. Each trial consisted of three phases: sample, delay, and choice phases, corresponding to the encoding, maintenance, and retrieval phases of WM, respectively. In the sample phase, one of the two reward arms was randomly blocked, and the mice ran for a food reward from the starting arm to the reward arm (encoding). In the delay phase, the mice were driven to return to the starting arm and maintain the information on the reward arm in the sample phase for a variable delay (such as 10 s) (maintenance). In the choice phase, the blocking wall was removed, and if the mice selected the previously closed arm to achieve a second food reward (retrieval), then this was considered to be a correct trial. However, if the mice selected the previously explored arm without achieving another reward, then this was recorded as a false trial. The final performance of the mice per day was the average correct rate of 10 trials. The mice underwent the training and testing of WM for 4 days to achieve > 80% accuracy in the task. After full acquisition of the task, the mice were then subjected to WM testing with an increasing delay time (from 10 to 20 s, 30 s, and 60 s) during the maintenance phase.

Fiber Photometry Recording and Ca2+ Signal Analysis

The dual-color and multichannel fiber photometry system (Thinker Tech Nanjing Bioscience Inc., Nanjing, China) and MatLab (Math Works, Natick, USA) data analysis with custom-written programs were applied according to the manufacturer’s instructions. In Ca2+ signal analysis during the WM task, we mainly captured the Ca2+ dynamics at three event-related time points: (A) when the mice reached the reward arm in the encoding phase; (B) when the mice returned to the start arm at the beginning of the maintenance phase; (C) when the mice reached the reward arm in the retrieval phase. The Ca2+ signal values (ΔF/F) were calculated as (F−F0)/F0, where F0 is the baseline Ca2+ signal when the mice stayed at the start arm or before running (2-s time window). Then, we recorded the Ca2+ signal ranging from 8 s before and 10 s after the events.

Histology and Fluorescence Imaging

Briefly, mice were deeply anesthetized and transcardially perfused with 4% paraformaldehyde after optic fiber recording. Then, the brains were removed, post-fixed in 4% paraformaldehyde for 4–6 h, and dehydrated in 30% sucrose for 2–3 days. Coronal sections (30 μm) around the fiber implantation sites were prepared and stained as described previously [43]. Briefly, the sections were blocked at room temperature for 1 h in 10% normal donkey serum, 1% bovine serum albumin, and 0.3% Triton X-100 in phosphate buffered saline (PBS) and incubated with primary antibodies overnight at 4°C (rabbit anti-NeuN, 1:400, Sigma, St. Louis, USA; mouse anti-S100β, 1:400, Sigma). The sections were rinsed in PBS, followed by incubation with the corresponding secondary antibodies (donkey anti-rabbit 488/594, 1:1000, Invitrogen, Carlsbad, USA and donkey anti-mouse 647, 1:1000, Invitrogen). Images were acquired with a fluorescence microscope (LSM 880; Carl Zeiss, Jena, Germany) using a 20× objective.

Statistical Analysis

The quantitative data are presented as the mean ± SEM, analyzed with GraphPad Prism 6 (San Diego, USA). All statistical tests were performed with IBM SPSS Statistics 25 (Chicago, USA). The quantitative differences between the Ca2+ patterns were compared using an independent two-sample t-test. The cross-correlation analysis between astrocytic and neuronal Ca2+ signals was done with MatLab software (Math Works). The level of statistical significance was set at P < 0.05.

Results

Three Astrocytic and Neuronal Ca2+ Peaks During the Encoding, Maintenance, and Retrieval Phases of WM

To investigate the real-time astrocytic and neuronal Ca2+ dynamics during distinct WM information processing, we coupled GECI-based dual-color fiber photometry with the T-maze-based DNMTP WM task to simultaneously monitor astrocytic and neuronal activity in the hippocampus of mice performing the WM task using a green astrocytic Ca2+ indicator (AAV2/9-shortGFAP-GCaMP6s) and a red neuronal Ca2+ indicator (AAV2/9-hSyn-jRGECO1a). The GCaMP6s and jRGECO1a immunofluorescence was specifically observed in the hippocampus (Fig. 1A). The Gcamp6s expression almost co-localized with the astrocytic marker S-100β (Fig. 1B, C) and the jRGECO1a expression pattern was similar to that of the neuronal marker NeuN (Fig. 1D, E), confirming the specific astrocytic and neuronal expression patterns of the two Ca2+ indicators.

Fig. 1
figure 1

Dual-color fiber photometry recording of astrocytic and neuronal Ca2+ signals in the hippocampus. A Schematic of dual-color fiber photometry to record astrocytic and neuronal Ca2+ signals by simultaneously expressing AAV2/9-shortGFAP-Gcamp6s (green) and AAV2/9-hSyn-jRGECO1a (red) in the hippocampus (scale bar, 50 μm). AAV2/9, adeno-associated virus type 2/9. B, C Representative images (B; scale bar, 50 μm) and statistics (C; mean ± SEM; n = 3 mice) showing the Ca2+ indicator Gcamp6s (green) is co-localized with the astrocytic marker S-100β (red) but minimally with the neuronal marker NeuN (red). D, E Representative images (D; scale bar, 50 μm) and statistics (E; mean ± SEM; n = 3 mice) showing the Ca2+ indicator jRGECO1a (red) is specifically co-localized with the neuronal marker NeuN (green) but not with the astrocytic marker S-100β (green). F Example traces of astrocytic (green) and neuronal Ca2+ (red) during behavior in the home cage and during habituation. G Averaged correlation coefficients of astrocytic and neuronal Ca2+ in the home cage and during habituation.

By analyzing the astrocytic and neuronal Ca2+ during the WM task, we observed that both astrocytic and neuronal Ca2+ exhibited specific patterns responding to the distinct phases of WM processing (Fig. 2A): (1) during the encoding phase, the astrocytic Ca2+ signal elevated when the mice approached the reward arm and peaked at the point of obtaining/retrieving the reward; the peak decreased immediately after obtaining the reward but started to increase again when the mice left the reward arm and returned to the start arm; (2) the signal peaked at the starting point of the maintenance phase when the mice arrived at the start arm and then immediately started to decrease and continued to decline (below the baseline) through 10–20 s of the maintenance phase; and (3) the pattern of the signal in the retrieval phase was mainly similar to that in the encoding phase. And we found the decay time of astrocytic Ca2+ in the encoding and retrieval phases was significantly shorter than that in maintenance phase (Fig. 2D, E).

Fig. 2
figure 2

Specific patterns of astrocytic and neuronal Ca2+ responses to WM processing. A Schematic of the T-maze-based DNMTP WM task, divided into encoding, maintenance, and retrieval phases. Astrocytic (green) and neuronal Ca2+ (red) signals are simultaneously recorded during these phases of WM (red dashed line, mouse arrives at the turning point; black dashed line, mouse arrives at the reward arm). B Hysteresis loops between neuronal and astrocytic Ca2+ activity, showing that the astrocytic activity increases following an increment in neuronal Ca2+ (red arrow) followed by the decrement of neuronal Ca2+ activity (green arrow). C Analysis of correlation coefficients between astrocytic and neuronal Ca2+ corresponding to the three transients above. D, E Astrocytic Ca2+ transients (D) and the decay time of Ca2+ signal (E) among the three phases of WM. F, G Neuronal Ca2+ transients (F) and the area under the curve (AUC) of Ca2+ activity (G) among the three phases of WM. The average traces of astrocytic Ca2+ and neuronal Ca2+ are each from 80 trials (8 mice). The shaded area represents SEM. ***P < 0.001; ****P < 0.0001; Tukey's multiple comparisons test.

Meanwhile, the neuronal Ca2+ signal exhibited distinct patterns during the three phases of WM (Fig. 2A): (1) during the encoding phase, the neuronal Ca2+ signal rose when the mice approached the turning point, peaked at the turning point, and decreased rapidly before reaching the reward arm; (2) during the maintenance phase, the signal remained at the baseline within 10–20 s; and (3) the signal in the retrieval phase was also similar to that of the encoding phase. We also found that the neuronal Ca2+ activity in maintenance phase (staying at the start arm) was significantly higher than that in encoding and retrieval phases (staying at the reward arm) (Fig. 2F, G). Thus, hippocampal astrocytes and neurons displayed distinct Ca2+ signal patterns during the three phases of WM, with astrocytic Ca2+ peaking at reward arms and neuronal Ca2+ peaking at the turning point during the encoding and retrieval phases and with astrocytic Ca2+ activity persistently decreasing and neuronal Ca2+ remaining constant during the maintenance phase.

We also analyzed the astrocytic and neuronal Ca2+ signals during high WM-loading periods with 20-s, 30-s, and 60-s delay times (Fig. 3). Both astrocytic and neuronal Ca2+ patterns in the encoding and retrieval phases were almost the same as in the 10-s delay time. With a prolonged delay phase, the astrocytic Ca2+ signal persistently decreased below the baseline for 20 s and then returned to the baseline level for the rest of delay time (up to 60 s). However, neuronal Ca2+ largely remained constant at the baseline level for ~ 30 s.

Fig. 3
figure 3

Dynamic transients of astrocytic and neuronal Ca2+ in the three phases of WM processing when the decay time is 20 s, 30 s, or 60 s. The average traces of astrocytic Ca2+ (green) and neuronal Ca2+ (red) are each from 80 trials (8 mice). The shaded area represents the SEM. The vertical scale bars represent 10% and 5% as indicated.

As astrocytic Ca2+ signals can be triggered by sensory stimulation, startle responses [18, 32], or locomotion [19, 51], it was difficult to separate the different components that trigger the specific patterns of astrocytic Ca2+ signals since the WM task involves multiple simultaneous inputs. However, when we recorded the astrocytic and neuronal Ca2+ signals simultaneously while the mice were in the home cage or during habituation in the T-maze, we did not observe any specific patterns in these two states (Fig. 1F, G).

Bidirectional and Inter-Locked Interaction Between Astrocytic and Neuronal Ca2+ During Information Processing of WM

We further analyzed the relationships between astrocytic and neuronal Ca2+ in the distinct phases of WM processing by constructing hysteresis loops (Fig. 2B). The hysteresis loop analysis revealed that neuronal Ca2+ increased initially and this was followed by the continuous elevation of astrocytic Ca2+, with an accompanying decrease in neuronal Ca2+. Lastly, neuronal Ca2+ increased again with the decay of astrocytic Ca2+. Furthermore, cross-correlation analysis indicated that astrocytes not only positively correlated to neurons with a delay of ~ 3 s, but also negatively correlated to neurons with a preceding time of ~3 s during the WM task (Fig. 2C). So, interestingly, both the delay and preceding times of astrocytic Ca2+ dynamics relative to neurons were ~ 3 s. In addition, the correlation analysis [52] of astrocytic and neuronal Ca2+ signals indicated a slight correlation during the home cage state and a weak positive correlation during habituation to the T-maze (Fig. 1G). Collectively, these analyses indicate bidirectional interactions and coordination between astrocytic and neuronal networks contributing to WM processing.

Rule Learning in WM Produces Plasticity Changes in the Astrocytic and Neuronal Ca2+ Dynamics

As the training proceeded, the WM performance improved by increasing the correct trial rate (Fig. 4A). To explore whether astrocytes are involved in modulating WM performance, we compared astrocytic Ca2+ activity between correct and wrong trials in the WM task (Fig. 4B). During the retrieval phase, astrocytic Ca2+ activity in the wrong trials was significantly higher than in the correct trials on day 3 (Fig. 4D) and day 4 (Fig. 4F). But during the encoding phase, astrocytic Ca2+ activity in the wrong trials was significantly weaker than in the correct trials on day 4 (Fig. 4E). Similarly, we also compared neuronal Ca2+ activity between the correct and wrong trials in the WM task (Fig. 4C). The results show that neuronal Ca2+ activity in the retrieval phase of the wrong trials was significantly higher than in the correct trials during day 2 (Fig. 4G) and day 3 (Fig. 4H). So, the main differences in astrocytic and neuronal Ca2+ signals between correct and wrong trials occurred on day 3 when the mice started to achieve good performance, suggesting a modulatory role of astrocytes in WM performance after rule learning.

Fig. 4
figure 4

Comparison of astrocytic and neuronal Ca2+ signals between the correct and wrong trials. A Performance of virus-injected mice in the T-maze-based WM task during the four days of training. B Averaged astrocytic Ca2+ transients in the correct trials (green) and wrong trials (orange) corresponding to the three phases of WM on each of the four training days (n = 8 mice; shaded area, SEM). C Averaged neuronal Ca2+ transients in the correct trials (red) and wrong trials (grey) corresponding to the three phases of WM during each of the 4 training days (n = 8 mice; shaded area, SEM). D–F AUC for astrocytic Ca2+ activity indicated in the dashed boxes in (B). G, H AUC for neuronal Ca2+ activity indicated in the dashed boxes in (C). The numbers of correct trials were 61 on day 1; 63 on day 2; 67 on day 3; 71 on day 4, and the numbers of wrong trials were 19 on day 1; 17 on day 2; 13 on day 3; 9 on day 4. *P < 0.05, paired-sample t-test. AUC, area under the curve.

To further determine whether astrocytic Ca2+ exhibits plasticity with rule learning in the WM task, we analyzed astrocytic Ca2+ transients in 4 training days. When WM performance increased after 4 training days (Fig. 4A), astrocytic Ca2+ activity exhibited plasticity changes (Fig. 5A). Astrocytic Ca2+ dynamics increased in the maintenance phase (Fig. 5C) and decreased in the retrieval phase (Fig. 5D) on training days 2–4 compared to day 1. Moreover, we found similar plasticity changes of neuronal Ca2+ activity at 4 training days (Fig. 5E): neuronal Ca2+ dynamics increased in the maintenance phase (Fig. 5G) and tended to decrease in the retrieval phase (Fig. 5H) on training days 2–4 compared to day 1. In the second part of the encoding phase, astrocytic Ca2+ activity increased (Fig. 5B) while neuronal Ca2+ dynamics decreased over 4 training days compared to day 1 (Fig. 5F). These results indicate that astrocytic and neuronal Ca2+ exhibit plasticity changes during WM processing.

Fig. 5
figure 5

The astrocytic Ca2+ signal exhibits plasticity with rulelearning during the WM task. A Astrocytic Ca2+ transients corresponding to the 3 phases of WM among the 4 training days. BD AUC for astrocytic Ca2+ activity indicated in the dashed boxes in (A). E Neuronal Ca2+ transients corresponding to the three phases of WM among the 4 training days. FH AUC for neuronal Ca2+ activity indicated in the dashed boxes in (E). *P<0.05, repeated measures one-way ANOVA.

Synchronization and Heterogeneity of Astrocytic Ca2+ Dynamics During Information Processing of WM in the Hippocampus, mPFC, and Striatum

To explore astrocytic Ca2+ signals in other WM-related brain regions, we applied multichannel fiber photometry to monitor the dynamics of astrocytic Ca2+ in the mPFC, striatum, and hippocampus simultaneously during the WM task (Fig. 6A). First, the astrocytic Ca2+ dynamics from the three regions were synchronized both during free movement (Fig. 6B) and performance of the WM task (Fig. 6D). Correlation of the dynamics between the hippocampus and mPFC, or between the hippocampus and striatum was > 0.6 (Fig. 6C), suggesting the synchronization of astrocytic Ca2+ among WM-related brain regions.

Fig. 6
figure 6

Synchronization and heterogeneity of astrocytic Ca2+ signals responding to WM processing in distinct brain regions. A Cartoon of multichannel fiber photometry to record astrocytic signals in the hippocampus, mPFC, and striatum, and the expression of AAV2/9-shortGFAP-Gcamp6s (green) in the three brain regions (scale bars, 1 mm). B Representative trials showing simultaneous astrocytic Ca2+ transients recorded in the hippocampus (red), mPFC (green), and striatum (blue) in freely-moving mice (vertical bars, 10% and 5%; horizontal bar, 20 s). C Correlation coefficients of astrocytic Ca2+ corresponding to WM processing the hippocampus and mPFC (left), and the hippocampus and the striatum (right). D Dynamic patterns of the averaged astrocytic Ca2+ signal in the hippocampus (red), mPFC (green), and striatum (blue) corresponding to distinct phases of WM (10 trials in 9 mice; shaded area, SEM). E AUC for astrocytic Ca2+ at different WM phases in distinct brain regions, as indicated in the dashed boxes in (D). HPC, hippocampus; mPFC, medial prefrontal cortex; STR, striatum. *P < 0.05, **P < 0.01, ***P < 0.001, Tukey's multiple comparisons test.

However, astrocytic Ca2+ in the three brain regions also displayed heterogeneity during WM processing (Fig. 6E): (1) during the encoding phase, astrocytic Ca2+ activity in the mPFC and hippocampus was significantly stronger than in the striatum; (2) during the maintenance phase, the activity in the hippocampus and striatum was stronger than in the mPFC; and (3) during the retrieval phase, the activity in the mPFC was higher than in the striatum. Taken together, these results not only demonstrated the existence of astrocytic Ca2+ synchronization in distinct WM-related regions but also showed the heterogeneity of the astrocytic Ca2+ dynamics in distinct brain regions during information processing of WM.

Discussion

One of the central issues is to identify specific astrocytic Ca2+ patterns in relation to the distinct phases of WM (such as encoding, maintenance, and retrieval), specific location in space, emotional state (reward experience/delivery), and memory acquisition (such as rule learning), such that the population Ca2+ dynamics in astrocytes contribute to real-time information processing of WM. Our online monitoring of the astrocytic and neuronal Ca2+ dynamics of animals performing a WM task provides the first analysis of the dynamic interaction of astrocytic and neuronal networks at the population Ca2+ level during distinct phases of WM. The most notable pattern of the astrocytic population Ca2+ signal was the three distinct Ca2+ peaks during the encoding and retrieval phases associated with the animal’s approach to the turning point in the T-maze (the decision-making point) and the inter-locked, antagonistic relationship with the neuronal Ca2+ dynamics at a characteristic 3-s phase difference during WM information processing.

Astrocyte Ca2+ Dynamics are Gated by Sensory Inputs and Modified by Reward Experience

The notable three astrocytic Ca2+ peaks in the encoding and retrieval phases were all spatially associated with the T-maze turning point; the onset of the astrocytic and neuronal Ca2+ peaks correlated with the animals’ approaching and returning to the turning point of the T-maze. While the neuronal Ca2+ signal peaked at the turning point location, astrocytic Ca2+ continued to increase until the animals reached the reward. The turning point may represent a (spatial) sensory input that evokes Ca2+ signals in astrocytes and neurons. Consistent with this, whisker stimulation triggers a transient increase in noradrenaline and, consequently, a transient Ca2+ rise in astrocytes in rodents [32], whereas repeated aversive stimulation produces sustained adrenergic release and increased levels of intracellular Ca2+ and cAMP in a large population of astrocytes [53]. The “place cells” in the hippocampus fire in unique sequences forming rhythmic activity such as theta oscillations when the animal is in specific locations, thus encoding critical spatial information [54]. It has been speculated that astrocytic Ca2+ responds to the “place cell” rhythmic activity and works in concert with neurons to encode spatial information during WM processing. Besides, delayed astrocyte activity can be triggered by a surge of neuronal network activity in the gamma range (30–50 Hz) induced by sensory stimuli [55]. Thus, the astrocytic Ca2+ dynamics may be triggered by neural activity of the place cells or neural rhythmic change of neurons in the hippocampus at this specific turning point of the T-maze. Alternatively, the turning point may also represent a decision-making point since the mice need to use specific spatial information to guide them to the reward at this point. Furthermore, these sensory-elicited astrocytic Ca2+ dynamics were further modified by the internal state of the animals, such as the reward experience. In particular, we noted that delivery of the reward in the encoding and retrieval phases was associated with a faster decrease in the Ca2+ than in the delay phase (without reward). Thus, sensory inputs (spatial information such as the turning point) and internal state dictate the astrocytic dynamics.

Furthermore, distinct from the encoding and retrieval phases, the astrocytic Ca2+ dynamics in the hippocampus started to decline immediately after the maintenance phase (in the absence of sensory input) and continued to decline below baseline for up to 20 s, but then remained at the baseline level for the rest of the prolonged delay time (from 20 to 60 s). Meanwhile, neuronal Ca2+ remained at the baseline level throughout the maintenance phase.

Entrainment of the Astrocytic and Neuronal Ca2+ Dynamics During Information Processing of WM (by Bidirectional and Antagonistic Interaction) with a Characteristic 3-s Phase Difference

The most notable Ca2+ patterns were that the astrocytic and neuronal Ca2+ dynamics at the population level (i.e., neural and astrocytic ensembles) exhibited inter-locked, antagonistic patterns during the encoding and retrieval phases. Both neuronal and astrocytic Ca2+ exhibited an initial rise as an animal approached the turning point of the T-maze, but neuronal Ca2+ started to decline after passing the turning point while astrocytic Ca2+ continued to rise. Astrocytes express a repertoire of receptors, transporters, and other molecules, enabling them to sense a multitude of synaptic mediators as well as cytokines. The continuing rise after the turning point may be explained by the synaptic release of neurotransmitters which can be sensed by astrocytes, contributing to the continued rise until reaching the reward. On the other hand, the continuous increase in astrocytic Ca2+ was closely associated with the decrease in neuronal Ca2+, resulting in the inter-locking of the astrocytic Ca2+ peak with the neuronal Ca2+ valley. The correlation analysis and hysteresis loop showed both positive and negative correlations between astrocytic and neuronal Ca2+ signals, supporting this bidirectional interaction (initially, neurons facilitate astrocytes and then astrocytes inhibit neurons). This antagonistic relationship between the rise of astrocytic Ca2+ and decrease of neuronal Ca2+ at the population level is in agreement with the similar antagonistic interaction of astrocytes and neurons in response to sensory stimuli [55]. This may involve the release of ATP/adenosine and GABA as gliotransmitters to inhibit synaptically-evoked firing at the surrounding synapses to modify neuronal plasticity [56, 57].

From the dynamic Ca2+ patterns, we recorded a neuronal peak at the turning point and an astrocytic peak at the reward delivery point, the neuronal peak preceding the astrocytic peak by ~2–3 s. Conversely, an astrocytic peak preceded a neuronal valley, also by ~2–3 s. These temporal relationships were supported by cross-correlation analysis showing that the phase-difference between the highest positive correlation of the neural and astrocytic Ca2+ dynamics as well as the highest negative correlation were both close to 3 s, probably due to the difference between the neuronal signal (15–100 ms) and the astrocytic signal (1–3 s) in response to sensory input [58]. This is consistent with a previous report showing that astrocytic visual responses are delayed by ~ 5 s relative to the surrounding neuronal responses [31]. This delay is apparently caused by the necessity for the total amount of integrated local synaptic activity to exceed a threshold to enable Ca2+ responses. On the other hand, the 3-s delay in neuronal Ca2+ in association with an increase in astrocytic Ca2+ reflects the time during which release of a gliotransmitter acts on neurons to produce an effect at the population level. This is consistent with the previous finding that optogenetic stimulation of astrocytes immediately evokes the blood oxygenation level-dependent signals representing neuronal activation [59]. Thus, neuron–astrocyte interaction at the population level is characterized by a second level of temporal relationship (average 2–3-s phase difference), likely involving the release of neurotransmitters and gliotransmitters in this bidirectional interaction.

Rule-Learning in WM is Associated with Reduced Astrocytic Ca2+ Signal During the Retrieval Phases

We also investigated whether learning can produce functional plasticity changes in astrocytic Ca2+ patterns by comparing the astrocytic Ca2+ dynamics during the correct versus wrong trials and by monitoring the dynamics after 4 days of training. The consistent difference between the correct and wrong trials was a weaker Ca2+ signal in the correct trials than in the wrong trials during the retrieval phase. Similarly, population astrocytic and neuronal Ca2+ signals during the retrieval phase were reduced on days 3 and 4 compared to day 1. These findings support the idea that rule learning in WM causes functional plasticity in the astrocytic Ca2+ dynamics. This decrease of the astrocytic Ca2+ dynamics may be a reduction (with modified tuning properties) of the same population or gradual reduction of other neural population activities that are not relevant to WM performance. This finding is consistent with a previous finding of an early expansion of the neuronal populations, followed by a reduction into smaller neuronal population at the proficiency stage during motor and neuroprosthetic learning [60].

Furthermore, rule learning in WM over four days also produced a gradual increase of astrocytic and neuronal Ca2+ signals during the maintenance phase of WM. The functional significance of these plasticity changes remains unclear as the hippocampal Ca2+ dynamics lack persistent activation during the maintenance in contrast to the PFC. The functional plasticity changes in the hippocampal astrocytic Ca2+ dynamics support the involvement of astrocyte population Ca2+ signal in the control of WM. This view is consistent with previous findings that astrocytes control WM by modulating the astrocytic release of D-serine to “boost” LTP induction [39], by the astrocyte-derived molecule, L-lactate, that is used as a metabolic fuel to enhance LTP at the hippocampal CA1 synapse [38], and by cannabinoid receptor type-1 signaling via inhibition of transmitter release [8]. Astrocytes may also mediate the effects of vigilance and arousal on memory consolidation and performance.

Synchronization and Heterogeneity of Astrocytic Ca2+ Signals Responding to WM Processing in Distinct Brain Regions

Synchronization of different brain areas is essential for cognitive performance and sensory perception and is regulated by behavioral demand [47]. The mPFC is considered to be critical to WM processing through persistent neuronal activation during the maintenance phase [36]. Increasing evidence supports the existence of neuronal network interactions between the mPFC and hippocampus during WM processing [42, 45, 47, 61, 62]. These two regions are interconnected by dense monosynaptic and polysynaptic interconnections, and are strongly neurally synchronized [63]. Notably, their synchronization (theta oscillations) is enhanced during the delay and choice periods of WM [64,65,66]. However, little is known about the dynamic interaction of the astrocytic network between the mPFC and hippocampus during WM processing. In the present study, we showed for the first time the synchronization of population astrocytic responses among WM-related brain regions, especially between the mPFC and hippocampus. We discovered that astrocytic Ca2+ activity in the mPFC and hippocampus was closely parallel and locked in phase during the encoding and retrieval phases (especially at the decision-making point), but not at other points (see Fig. 6D), suggesting strong synchronization of the astrocytic network between the two brain regions during WM processing (rather than synchronizing with motor activity which displayed non-phasic pattern). Given the rapid propagation of astrocytic Ca2+ waves, this synchronization of the Ca2+ dynamics in the mPFC and hippocampus while performing the WM task [42, 61, 67] may be explained by a coordination through astrocytic Ca2+ network synchronization.

We also demonstrated the heterogeneity of the dynamic characteristics of astrocytic Ca2+ in distinct regions responding to WM processing. Basal astrocytic Ca2+ activity in the striatum was much lower than in the mPFC and hippocampus, consistent with a previous study reporting distinct Ca2+ homeostasis in striatal and hippocampal astrocytes [48]. These differences in basal astrocytic Ca2+ may be attributed to a difference in Ca2+ entry into the cell through Ca2+-permeable receptors, Ca2+ channels, and Na+/Ca2+ exchangers, as well as Ca2+ passing through IP3Rs on the endoplasmic reticulum and through the mitochondrial permeability transition pore.

Collectively, our analysis revealed that the dynamics of astrocytic Ca2+ are gated not only by sensory inputs (centering at the turning point of the T-maze) but also by the internal state (delivery of the reward) during the sample and choice phases. Importantly, our recordings revealed a strong inter-locked and antagonistic relationship between the astrocytic and neuronal Ca2+ dynamics, in which increased neural inputs may trigger astrocyte Ca2+ increase, which in turn constrains neuronal activity during WM information processing. This inter-locked relationship between the astrocytic and neuronal Ca2+ dynamics during WM information processing is characterized by a temporal pattern with a 3-s phase difference. Furthermore, rule-learning during WM training causes functional plasticity changes in astrocytic Ca2+ with a reduced signal during the retrieval phases. Last, we uncovered the synchronization and heterogeneity of astrocytic Ca2+ signals responding to WM processing between the hippocampus, mPFC, and striatum. These findings support the existence of bidirectional and inter-locked communication between astrocytes and neurons at the population level during WM information processing, and they work in concert to fine-tune the dynamic range of the stimulus-response relationship according to the sensory inputs and WM performance. As current application of optogenetics to astrocytes usually does not produce a temporally precise activation of astrocytes (ranging from 10–30 s [68, 69] to 5–15 min [1, 70]), this hampers our ability to dissect a causal role of astrocytes in the distinct phases of WM information processing (i.e., within a few seconds). Future studies are warranted to validate the function of these astrocytic Ca2+ patterns by refined optogenetic manipulation of the astrocytic Ca2+ dynamics or replay of astrocytic Ca2+ patterns.