Introduction

Over the last decade, retinoic acid (RA), a bioactive metabolite of vitamin A, emerged as an important regulator of brain development and functions. Signaling by RA is mediated by its binding to nuclear receptors (RARα, β, γ), which form heterodimers with retinoid X receptors (RXRα, β, γ) and act as ligand-controlled transcription factors. Several lines of evidence indicate that RA signaling is particularly important for functions of the striatum, the brain region critically involved in control of several functions including motor control, cognition, reward, and motivation. Accordingly, among different brain regions, the adult rodent striatum contains some of the highest levels of RA [1]. The striatum is also a site of strong expression of two retinoid receptors, RARβ and RXRγ [2, 3]. Genetic ablation of RARβ and/or RXRγ leads to abnormal striatal functions, revealed by deficits in motor coordination and depressive-like behaviors [4, 5]. Whereas some of these phenotypic abnormalities may have a developmental origin ([6, 7]), post-natal functions of these receptors have also been documented on evidence of RXRγ-dependent control of affective behaviors and dopamine D2 receptor (DRD2) signaling in the nucleus accumbens shell (NAcSh) [5, 8]. Despite these studies, our understanding of RA-dependent control of striatal functions is limited, due to the scarcity of knowledge about the transcriptional targets of RA signaling in the brain, which until now were mostly suggested by in vitro studies performed on different types of cultured cells and for few genes also validated in selected brain regions (for review see [9, 10]). To date, the best characterized RA-targets are the RARβ (Rarb) gene itself [11, 12] and the dopamine D2 receptor (Drd2) gene [4, 13].

Identification of mechanisms of RA signaling in the striatum should have direct relevance for understanding of pathophysiology of Huntington’s disease (HD), Parkinson’s disease (PD), schizophrenia, or depression, which are all associated with striatal dysfunction [14]. Importantly, compromised RA signaling due to reduced expression or activity of RA-synthesizing enzymes was documented for retinaldehyde dehydrogenase (RALDH) 1 in PD [15] or RALDH2 in Alzheimer’s disease (AD) [1618]. A causal relationship between compromised RALDH1 or RALDH2 activity and these diseases has also been suggested [16, 19]. Reduced expression of retinoid receptors in the striatum was reported in HD, a disease which severely affects striatal functions and is caused by expansion of polyglutamine repeats in the huntingtin (HTT) protein, leading to abnormal HTT aggregates. Specifically, reduced RARβ and RXRγ transcript levels were observed in the striatum from HD patients [20], whereas in the R6/2 transgenic mouse model of HD only a reduction of RXRγ mRNA was reported [21]. It is therefore tempting to hypothesize that an overall reduction of RA bioavailability or reduced expression and signaling of RARs in the striatum may constitute a mechanistic link between common symptoms of HD, PD, and AD, which all eventually show atrophy or neurodegeneration of ventral striatum and associated psychiatric symptoms.

We report here the first genome-wide analysis of RAR targets in the brain. Through genome-wide mapping of RARβ binding sites in the striatum enriched for ventral region and determination of transcriptome changes occurring in the ventral striatum (nucleus accumbens shell; NAcSh) after genetic ablation of RARβ, we distinguish bona fide (genes bearing RARβ binding sites, which expression is changed in RARβ−/− NAcSh), potential (genes bearing RARβ binding sites, but which expression is unchanged in RARβ−/− NAcSh), and presumably indirect targets (with altered expression in RARβ−/− NAcSh, but without RARβ binding sites) of RARβ. Functional annotations of those genes reveal exceptionally high links between compromised RA signaling and HD, and point to deficient signaling through G-protein coupled receptors (GPCRs), cAMP, and Ca2+ as a molecular link between compromised RARβ expression and some common histopathological and clinical symptoms of HD, PD, and AD. In support of this hypothesis, we found that ventral striatum, the region affected in all of these pathologies, is particularly prone to deficits in RA signaling. We also provide evidence that reduced RARβ expression in HD may result from partial sequestration of RARβ in aggregates of mutant HTT protein, which we characterized in the striatum of R6/2 transgenic mice.

Results

Genome-wide Identification of RARβ Binding Sites in Mouse Striatum Reveals Its Potential Transcriptional Targets and Suggests Implication of Retinoid Signaling in Neurological Disorders

To gain insight into transcriptional regulations by RARβ in the brain, we first investigated genome-wide distribution of RARβ binding sites in mouse striatum. For this purpose, we set-up conditions for efficient chromatin immunoprecipitation (ChIP) using striatum enriched for its ventral part. To test the specificity of ChIP, we performed a series of ChIP-qPCR analyses of a DNA region containing a known RA-response element (RARE) within the promoter of the RARβ gene [11, 12]. Accordingly, we demonstrated high (14-fold) enrichment of this region when compared to DNA region not containing RARβ binding sites [22], located -1203 to -1059 base pairs (bp) upstream of the RARβ transcription start site (TSS) [22] (Fig. 1a, left panel). Similarly, a 10-fold enrichment of RARE-containing region was observed after using anti-RARβ antibody as compared to a non-specific antibody directed against GFP (Fig. 1a, compare left and middle panels). The RARE-containing region was also highly enriched for histone 3 lysine 4 trimethylation (H3K4me3; Fig. 1a), which is associated with transcriptionally active or poised genes.

Fig. 1
figure 1

Validation of RARβ ChIP and ChIP-seq data for the mouse striatum. a Quantification (qPCR amplification) of ChIP fragments of the region of the RARβ gene promoter containing a “direct repeat 5” (DR5) RARE (RARβ DR5, black bars) and another, far-upstream region (RARβ -1000; gray bars). FC, fold change. b University of California at Santa Cruz (UCSC) web browser view of sequence tag density in.wig file format of the RARβ- and H3K4me3-occupied sites of the RARβ gene locus. Sequence tag density of GFP used as negative control is also shown

Using the same biological material, we then performed ChIP followed by high-throughput sequencing (ChIP-seq), which revealed 8075 RARβ binding sites when compared to ChIP-seq with anti-GFP antibody used as a negative control. Using the GPAT software [23], these binding sites were annotated to 5466 Ensembl transcripts, which corresponded to 4607 genes. As expected, among binding sites determined by ChIP-seq analysis we found the known RARE within the RARβ gene promoter (Fig. 1b). Functional annotations carried out with the Database for Annotation, Visualization, and Integrated Discovery (DAVID) [24] and Genomic Regions Enrichment of Annotations Tool (GREAT) [25] pointed to protein modifications, intracellular signaling cascades, regulation of small GTPase-mediated signal transduction, synaptic transmission/neurotransmitter release, cytoskeleton organization, and cell motility (p = 10e−14–10e−5), as the main biological processes associated with genes bearing RARβ binding sites. When analyzed using Ingenuity Pathway Analysis (IPA), such potential RARβ target genes revealed remarkably high association with neurological diseases including, by order of statistical significance, Huntington’s disease (p = 1.15e−28), schizophrenia (p = 1.09e−14), AD (p = 1.87e−7), and PD (p = 8.32e−7). Interestingly, both IPA and DAVID analyses revealed that 287 genes bearing RARβ binding sites were associated with HD. These genes (Table 1, Supplementary Table S1) are primarily involved in control of mitochondrial functions including oxidative phosphorylation, e.g., different subunits of ATP synthase, cytochrome c oxidases (Cox), NADH dehydrogenase, succinate dehydrogenase complex subunits (Sdha and Sdhc), and general mitochondrial functions controlled for example by voltage-dependent anion channels (Vdac 12). This suggests that abnormal mitochondrial functions under conditions of altered RARβ signaling may underlie common aspects of different neurodegenerative diseases. Accordingly, 64 out of 287 genes associated with HD were also reported to play a role in AD and/or PD pathophysiology (Supplementary Table S1). However, the majority of genes (223 genes) bearing RARβ binding sites were associated exclusively with HD, and not with AD or PD. Those genes are strongly linked to transcriptional regulation (52 genes, see Supplementary Table S2) and include two subunits of RNA polymerase II (Polr2a, Polr2i), transcription factors (e.g., Meis2, Sp1, Notch2, Foxp1, Nr1d1), and cofactors (e.g., Ppargc1b, Rcor1), or chromatin-modifying enzymes (e.g., Hdac2, Kdm3a). Furthermore, this “HD-specific” pool of genes was also enriched for GPCRs (e.g., dopaminergic receptors Drd1, Drd2, Drd3, cannabinoid receptor Cnr1, or cholinergic receptors Chrm1, Chrm4) and proteins involved in GPCR signal transduction, including components of the cAMP signaling pathway or linked to calcium signaling (Table 1, Supplementary Table S1). Abnormal functions of these signaling pathways may be relevant for psychiatric symptoms, including psychosis and depression, clinical symptoms frequently encountered in HD, and for other neurodegenerative diseases or psychiatric disorders. Accordingly, among 287 potential RARβ targets associated with HD, 40 genes are also related to pathophysiology of schizophrenia (Supplementary Table S1).

Table 1 Top 10 pathways (IPA analysis) related to genes containing at least one RARβ binding site and reported to be affected in HD

In addition to IPA- and DAVID-/GREAT-driven data mining, we compared our mouse ChIP-seq data with transcriptomic changes reported in caudate nucleus of HD patients [20]. Using this approach, we found that 678 genes containing RARβ binding sites displayed abnormal expression in human HD caudate samples (Supplementary Table S3), corresponding to 15 % of all transcripts found to be differentially expressed in the striatum of HD patients and such overlap was statistically significant (p = 3.135e−5) as revealed using hypergeometric distribution analyses. Although this comparison does not take into account interspecies conservation of binding sites, such a high overlap supports the idea of a role of abnormal RARβ signaling in pathophysiology of HD.

Loss of RARβ Function Leads to HD-Like Transcriptional Changes and Indicates Direct Targets of RARβ in Mouse Striatum

Transcriptome Alterations in RARβ−/− NAcSh

The identification of RARβ binding sites by ChIP-seq analysis pointed to a large pool of genes which transcription may be controlled by RARβ. However, as suggested by previous studies [22, 26, 27], only a fraction of RAREs may be functional in physiological conditions, whereas the activity of others may become evident in pathological, pharmacological, or yet other specific conditions. Analysis of transcriptional changes in mice carrying a null mutation of RARβ provides a useful means to indicate functional binding sites, and to provide stronger candidates for RARβ targets. We performed such transcriptomic analysis on the ventral striatum (NAcSh), the region affected in HD [28, 29], and the dysfunction of which may contribute to the three main aspects of HD pathophysiology, i.e., motor, cognitive, and affective abnormalities. Using Affymetrix GeneChip Mouse Gene 1.0 ST arrays, we identified 442 up-regulated and 614 down-regulated transcripts in the NAcSh of RARβ−/− mice (Fig. 2a, b). Such changes were significant as confirmed by Gene Set Enrichment Analysis (GSEA) analyses. IPA and DAVID analyses revealed that primary molecular and cellular functions affected by ablation of RARβ include cellular communication and development, carbohydrate metabolism, molecular transport, and small molecule biochemistry (Supplementary Table S4). Significant expression changes of a number of neurotransmitter receptors (e.g., Cnr1, Chrm4, Drd3, Htr1b), transporters (e.g., Slc17a7, Slc17a8, Slc5a7, Slc20a1), or metabolic enzymes (Pde10a, Pde11a, Pde4b) point to affected neurotransmission, synaptic signaling, metal ion transport, and metabolism of cyclic nucleotides. We also noted a deregulation of several genes encoding proteins involved in the control of calcium ion binding and signaling (e.g., Calb1 and 2, Cacna2d3, Cadherins, Actn2, Kcnip1). Analyses of signaling pathways revealed that the most significantly affected pathways are linked to G-protein signaling (cAMP-mediated signaling, Gαi, and GPCR signaling, which were among the top 3 canonical pathways affected by ablation of RARβ; Supplementary Table S5). Such transcriptional changes are relevant to neurological and psychiatric conditions, including schizophrenia (p = 2.57e−11), HD (p = 3.19e−8), and mood disorders (p = 3.89e−8). Specifically, 58 transcripts altered in the RARβ−/− NAcSh were reported by IPA analysis to be associated with HD pathophysiology (Table 2). The main functions affected by those alterations comprise calcium homeostasis, neurotransmission, G-protein signaling, and transcription. We compared our data with available transcriptomic data from caudate nucleus of HD patients [20]. Strikingly, 155 transcripts which expression was significantly affected (t test, p ≤ 0.05) in RARβ−/− NAcSh were also altered in HD (p = 0.037, revealed using the hypergeometric distribution), which corresponds to 15 % of transcriptional changes in the murine RARβ−/− striatum and 3.5 % of all transcriptional changes in the striatum of HD patients (Supplementary Table S6).

Fig. 2
figure 2

Transcriptional targets of RARβ. a Volcano plot representing transcriptional changes in the NAcSh of RARβ−/− mice. The fold changes (FC) of gene expression in RARβ−/− NAcSh were calculated with respect to WT NAcSh, and are illustrated on the horizontal axis using a logarithmic scale. Vertical axis represents the corresponding p values. The significance cutoff was set at p = 0.01, which corresponds to a false discovery rate (FDR) of 0.09. b Venn diagram representing the overlap between genes assigned with RARβ binding regions as determined by ChIP-seq, and genes altered in expression in the NAcSh of RARβ−/− mice, as determined by transcriptomics. c, d qPCR analysis of transcriptional targets of RARβ in the NAcSh and CPu, respectively. The genes selected for analysis correspond to transcripts with impaired expression in Huntington’s disease.*p < 0.05; **p < 0.01; ***p < 0.001

Table 2 Transcripts associated with HD pathophysiology and quantitatively affected in the NAcSh of RARβ−/− mice

Transcriptional Targets of RARβ in the Mouse Striatum

To further assess relevant RARβ targets, we compared our transcriptome data with genes assigned by ChIP-seq to contain at least one RARβ binding site. Among a total of 248 such genes, 103 were up-regulated and 145 were down-regulated (Fig. 2b; Supplementary Table S7). As expected, several genes known as direct transcriptional targets of RA were enriched in this group, including Stra6, Dhrs3, and Chrm4, but there were many genes not known so far for being RA-regulated. In order to gain insight into the functional consequences of ablation of RARβ, we performed functional annotation of those genes. IPA and DAVID annotations indicated that neurotransmission and cellular morphology (including microtubule dynamics, organization of cytoskeleton, and neuritogenesis) were among the primary cell functions associated with RARβ transcriptional target genes (Table 3). Although regulation of GPCR signaling by RARs was previously reported on evidence of transcriptional control of Drd2 or Oprk1 [13, 30], we have now extended the list of RARβ transcriptional targets to other GPCRs (e.g., Chrm4, Gabrg3, Gpr88), and other specific components of G-protein signaling pathways like Rasd2, Rgs9, Pde10a, and Kcnk2. We found significant deregulation of modulators of calcium homeostasis, including down-regulation of transcripts encoding ion channels (Kcnip1, Cacna2d3), and abnormal expression of genes which products indirectly control intracellular Ca2+ signaling (Actn2, Strn, Nrgn, Scn4b) (Supplementary Table S7).

Table 3 Neurological diseases and functional annotations associated with 248 potential striatal direct targets of RARβ

In order to validate our transcriptomic data by qPCR, we chose 14 randomly selected genes associated with HD (Table 3), adding to this analysis Rxrγ and Drd2. These latter genes are known to be RA transcriptional targets which escaped our selection criteria for determination of RARβ binding sites in ChIP-seq (Rxrγ, BS at −25.4 kbp) and transcriptome fold change (Drd2, FC 0.81, p = 4.9e−5). With exception of Cnr1, we confirmed all transcriptional changes which were detected by microarray analysis in the NAcSh (Fig. 2c). The most prominent down-regulations were observed for Scn4b, Gpr88, Actn2, Tac1, Rasd2, Rgs9, Pde10a, Kcnk2, Cacna2d3, Gabrg3, Rxrγ, Drd2, and Kcnip1, and up-regulation of Nrgn and Synpr was confirmed in the RARβ−/− NAcSh (Fig. 2c). Such changes were consistent with those observed in the striatum from human HD patients or R6/2 mice [20, 21, 31], with the exception of increased Nrgn and Synpr expression, and unchanged levels of Cnr1 in RARβ−/− mice. Surprisingly, only few transcripts were also affected in the dorsal striatum (caudate putamen; CPu) of RARβ−/− mice (Fig. 2d). Indeed, only Gabrg3, Nrgn, and Synpr displayed similar magnitude of changes in CPu and in NAcSh, whereas Scn4b and Tac1 displayed 2-fold weaker changes in the CPu, and Cnr1 displayed an increased expression in the CPu, but not the NAcSh of RARβ−/− mice (compare Fig. 2c and d). Such differences suggest that RARβ signaling may be more sustained in the ventral striatum possibly due to higher availability of RA suggested by stronger expression of proteins involved in retinol transport and metabolism for example RBP1 (also known as CRBP1; [32]) and STRA6 [33].

RARβ Is Sequestered in Huntingtin Protein Aggregates in R6/2 Mouse Striatum

The strong reduction of RARβ transcripts observed in post-mortem caudate nucleus samples from HD patients [20] led to the hypothesis that reduced expression of this receptor and compromised RA signaling could contribute to the pathophysiology of HD. Whereas our analyses of RARβ transcriptional target genes support and elaborate this hypothesis, it is not clear whether and how reduction of RARβ receptor level is attained in HD. To address these points, we analyzed RARβ expression and its subcellular distribution in ventral and dorsal regions of the striatum of R6/2 transgenic mice, a widely employed model of Huntington’s disease [34]. Real-time quantitative PCR revealed a significant reduction of RARβ mRNA in early symptomatic R6/2 mice, in the NAcSh (2.85 ± 0.43 arbitrary units for wild-type (WT) and 1.32 ± 0.18 for R6/2, t test, p = 0.0004) and CPu (2.27 ± 0.22 arbitrary units for WT and 0.98 ± 0.18 for R6/2, t test, p = 0.002). RARβ expression was not affected prior to aggregate formation in the developing striatum of R6/2 mice at E18.5 (0.256 ± 0.02 arbitrary units for WT and 0.222 ± 0.06 for R6/2; ns). Using immunofluorescent detection, we observed abundant expression of RARβ in the CPu and NAcSh of adult WT mice, but this expression was decreased in R6/2 mice (Fig. 3a, b) and absent in RARβ−/− mice (Supplementary Fig. S1). Quantitative immunofluorescence analyses using the Imaris software revealed that the amount of RARβ protein was significantly (on average ~50 %) reduced in R6/2 mice in both CPu (t test, p = 2.36e−5) and NAcSh (t test, p = 3.46e−6), as illustrated by lower numbers of cells displaying strong expression of RARβ (Fig. 3c). To address the mechanism of such reduction, we investigated whether low RARβ expression may result from its sequestration within HTT aggregates in R6/2 mice. Analyses of RARβ distribution within the nuclei of individual cells in R6/2 striatum revealed about 1.75-times higher levels of RARβ signal in HTT aggregates, as compared to the other regions of the nuclei, and such enrichment was observed in both the CPu (t test, p = 3.4e−19) and NAcSh (t test, p = 8.1e−18) (Fig. 3b, d). That such a colocalization reflects partial sequestration of RARβ in HTT aggregates is also supported by absence of RARβ foci in R6/2 transgenic striatum prior to aggregate formation during prenatal stage of brain development (Supplementary Fig. S2). We also observed a significant reduction of cell density in the NAcSh (3253 ± 343 cells/mm2 for WT and 1998 ± 166 for R6/2), but not in the CPu (1743 ± 177 cells/mm2 for WT and 2225 ± 207 for R6/2), of R6/2 transgenic mice. Such region-selective cell decrease was supported by significant interaction between the genotype and striatal sub-region for cell counts in two-way ANOVA analyses (F [1, 32] = 12.8; p = 0.001).

Fig. 3
figure 3

Quantitative analyses of RARβ expression in the striatum of R6/2 mice. a, b Immunofluorescence detection of RARβ (red) and HTT (green) in coronal sections of the CPu in 9-week-old WT (upper panels) and R6/2 transgenic mice (lower panels). The region displayed is boxed in the scheme inserted in lower corner of a. The magnification of RARb and HTT colocalization was shown in the plane of HTT aggregate in the upper right corner of the b. DAPI-stained nuclei are shown in blue. Scale bar, 10 μm. c Quantitative analysis of RARβ expression is shown as number of cells displaying different intensities of RARβ signal in CPu and NAcSh of WT and R6/2 mice. d Intensity of RARβ labeling detected by immunofluorescence within HTT aggregates is compared to its levels in other regions of the nucleus, in CPu and NAcSh of R6/2 mice. CPu, caudate putamen; NAcSh, nucleus accumbens shell

Structural Analysis of RARβ Binding Sites

Our study provides the first in vivo data on RAR binding sites in the brain. We analyzed the genomic architecture of these binding sites. Using the Homer software [35], we found that binding sites were highly enriched in core promoters (±50 bp from the TSS) and proximal promoter regions (−300 to −50 bp), where they appeared 10 times more frequent than in distal promoter regions (−5000 to −300 bp) or gene body (+50 bp to transcription termination site) (Fig. 4a). However, whereas binding sites located within the promoter (core, proximal and distal) represented 24 % of all regions occupied by RARβ, the majority (55 %) of binding sites were mapped to the gene body and 21 % to intergenic locations (Fig. 4b). The distribution of RARβ binding sites in the promoter regions was similar to DNA occupation by H3K4me3 (Fig. 4c). SeqMINER clustering analyses of RARβ and H3K4me3 binding sites with respect to their distribution profiles and TSS proximity revealed that for the total of 6273 Ensembl transcripts (Homer genomic annotation), 3640 were located within ±5 kbp from the TSS and colocalized with H3K4me3 binding regions (Fig. 4d). For 1045 of those transcripts, RARβ and H3K4me3 occupied large DNA regions extending from the TSS towards gene body (cluster 1, Fig. 4d). More frequently (2595 transcripts), such an overlap was restricted to a narrow region localized in the vicinity of the TSS (cluster 2, Fig. 4d), whereas for 2633 transcripts there was no overlap between RARβ and H3K4me3 binding sites within ±5 kbp (cluster 3, Fig. 4d).

Fig. 4
figure 4

Characterization of striatal profiles of RARβ and H3K4me3 DNA-binding sites. a Genomic position of RARβ binding sites with respect to transcription start sites (TSS). b Genomic annotation for RARβ binding sites. Distribution of RARβ binding regions within the genome was set-up using Homer. The following location criteria were used based on the distance from TSS: core promoter ±50 bp, proximal promoter −300 to −50 bp, distal promoter −5 kbp to −300 bp, gene body +50 bp to transcription termination site (TTS), intergenic (variable lengths). c Genomic position of H3K4me3 binding sites with respect to TSS. d Comparative clustering of RARβ and H3K4me3 ChIP-seq data using seqMINER. Read densities were established as regions surrounding the set of TSS of mouse genes based on Ensembl (v.67). Tag densities from each ChIP-seq data set were collected in a window of 10 kbp encompassing the TSS. The collected values were subjected to k-means clustering coupled to linear normalization. The intensity of H3K4me3 was lowered 10-fold

Globally, the binding sites were composed of different forms of repeated sequences previously described as binding sites for RA receptors [3638], and characterized by the presence of two consensus half-sites (RGKTCA) with variable spacing and orientation. Our ChIP-seq analysis revealed that RARβ occupies a large repertoire of direct repeats (DR0–DR10), inverted repeats (IR0–IR10), and everted repeats (ER0–ER10) (Fig. 5). These three types of elements were previously reported from ChIP-seq analyses in murine embryonic stem cells and embryonal carcinoma F9 cells using a pan-RAR (recognizing all three RARs) antibody [39]. As expected, DRs were the most frequently encountered RAREs, representing 79 % out of 957 consensus binding elements (no mismatch from consensus) (Fig. 5a). Among 957 elements, DR0 were the most represented (188), followed closely by DR2 (174) and DR5 (172) (Fig. 5b). We also identified about 130 highly conserved IRs and a similar number of ERs, with IR0, ER8, and ER10 as the most frequently encountered (Fig. 5c, d). Often several different elements were present within a single peak corresponding to a RARβ occupied DNA region. As determined with the help of regulatory sequence analysis tools (RSAT) [40], 257 peaks also harbored consensus binding sites for estrogen-related receptors α and β (ESRRA and ESRRB).

Fig. 5
figure 5

Consensus RA-response elements (RAREs) in RARβ-bound loci. a Overview of the frequency of direct repeats (DR), inverted repeats (IR), and everted repeats (ER) of the consensus RARE half-site (RGKTCA) identified through our bioinformatics analysis of RARβ-bound loci. bd Distribution of DRs (b), IRs (c), and ERs (d) according to the spacing between half-sites (numbers from 0 to 10 refer to the number of base pairs separating the half-sites)

Among 248 RARβ transcriptional targets (bearing RARβ-binding sites and which expression was altered in RARβ−/− mice), only 64 (25 %) contained conserved DRs, IRs, and ERs. Since the population of highly conserved RAREs was poorly represented in this group of genes, we performed additional search for RAREs, allowing one mismatch in the RARE half-site consensus sequence. This led to the detection of a total of 833 motifs, which were attributed to 174 out of 248 genes (70 %). Despite an overall increase of RARE-like elements, suggesting a high flexibility of RARβ in DNA recognition, the distribution between different RARE subtypes was not remarkably affected, with the number of DRs decreased only from 70 to 60 % in favor of an increase of IRs and ERs (20 % each). When searching the pool of 248 genes for RAREs with consensus half-sites or allowing one mismatch, DR5, DR0, DR7, and DR2 were most represented motifs. A de novo motif search in RARβ-occupied loci that did not contain RAREs revealed the presence of one or more Sp1-binding motifs within 50 % of such RARβ-bound regions.

To gain insight into the mechanisms of RARβ-dependent transcriptional control, we searched for core promoter elements (CPE; for review see [41]) in the core promoter region (±50 bp from the TSS) of the 248 RARβ transcriptional targets. Surprisingly, when searching for eight known CPE consensus sequences (TATA box, BREu, BREd, Inr, DPE, DCE, MTE, or XCPE1), we did not find any TATA box, but mostly detected Inr (18 %), DPE (13 %), BREd (12 %), and BREu (7 %). Focusing these analyses on 29 genes associated with HD, we did not find enrichment of any specific CPE. However, we noticed that 25 out of these 29 genes belonged to cluster 1, which is characterized by a broad occupation pattern of H3K4me3 (Fig. 4d), in contrast with the full set of 248 genes which were distributed approximately equally across the three clusters.

Discussion

Retinoic acid is indispensable for normal development of many organs including the brain, but its role in the adult central nervous system (CNS) is poorly recognized. One of the central reasons could be that availability of RA is highly secured in the brain by homeostatic mechanisms involving its local synthesis and/or peripheral metabolism. Thus, manifestations of abnormal control of brain functions by the RA pathway may become apparent only in cases of extreme vitamin A deficiency or in pathological conditions. To uncover such functions, it is critical to identify molecular substrates of RA signaling, including direct transcriptional targets of its receptors.

RARβ Regulated Signaling Pathways and Cell Functions

Using chromatin immunoprecipitation followed by high throughput sequencing (ChIP-seq), we have identified genomic RARβ binding sites in the mouse striatum, the main domain of RARβ expression in the brain [2], and the key brain region involved in control of a number of psychomotor functions [14]. Among 4607 genes bearing at least one binding site, we identified 248 which transcription was altered in the striatum of RARβ−/− mice (as detected in our comparative transcriptomic analysis of the NAcSh of WT and RARβ−/− mice). Those genes are proposed as strong candidates for being direct transcriptional targets of RARβ. Their functional annotations point to GPCR signaling as one of the major pathways regulated by RARβ. In particular, we identified new transcriptional targets like Drd3, Gpr88, or cAMP catabolic enzymes (Pde10a, Pde4b, Pde1b), and we determined in vivo RARβ-binding sites for genes previously reported as being regulated by RA, for example Stra6, Cnr1, Chrm4, Drd2, or Nrgn [13, 39, 42, 43]. Analyzing the potential impact of such regulations on dopaminergic (DRD1- and DRD2-mediated) signaling can be particularly instructive about RARβ control of striatal functions, as cell type-specific expression of DRD1 and DRD2 identifies, respectively, striatonigral and striatopallidal GABAergic projection neurons, which constitute the two main striatal outputs pathways. Imbalanced signaling through these pathways has been suggested to underlie symptoms of several neuropsychiatric disorders including HD and PD, as well as drug addiction and depression (for review, see [14, 44]). RARβ may be required for the activity of both pathways, contributing at multiple levels to their balanced signaling. Our data indicate that RARβ deficiency may lead to compromised activities of DRD2-positive striatopallidal pathway, resulting from concomitant reduction of transcription of Drd2 and Pde10a, the latter coding for a cAMP-catabolizing enzyme (phosphodiesterase) functionally associated with DRD2 activities [45].

Among direct transcriptional targets of RARβ, we also found the gene encoding cannabinoid receptor 1 (Cnr1), which displayed the most prominent increase of expression among genes that were tested by qPCR in the CPu of RARβ−/− mice. A consequence of enhanced Cnr1 expression would be an inhibition of DRD1 and DRD2 signaling, as previously suggested [46], but such effect would be limited to the CPu as no significant increase of Cnr1 expression was noted in the NAcSh of mutant mice. An inhibitory effect of CNR1 on adenylate cyclase activity may contribute to control of DA receptor signaling, but more importantly may lead to an overall reduction of cAMP availability in the RARβ−/− striatum. Although the net effect of opposing expression changes of Cnr1 and Pde10a, Pde1b, and Pde4b on cAMP levels need to be addressed experimentally, we cannot exclude the possibility of cell type-specific changes in cAMP availability.

Functional annotations of direct transcriptional targets of RARβ also point to a multilevel control of calcium homeostasis and signaling, which in absence of RARβ may be impaired and possibly harmful to striatal neurons. Thus, reduced expression of Cacna2d3, the subunit of Ca2+ channel known to limit action potential-driven Ca2+ influx [47], may lead to increased intracellular Ca2+ levels following neuronal activation in RARβ−/− mice. Similarly, down-regulation of the Ca2+ sensor KCNIP1, a critical subunit of potassium Kv4 channel, may lead to prolonged neuronal depolarization periods during action potentials, which in turn would result in extended duration of activity-driven Ca2+ influx. In contrast, reduction of Actn2 and Rgs9 expression in RARβ−/− mice could act to counterbalance increased Ca2+ influx. Synaptic communication and plasticity could be further affected due to inefficient utilization and signaling of intracellular and synaptic Ca2+, which is controlled by STRN and NRGN, both being direct targets of RARβ transcriptional activity [48, 49].

RARβ and Central Nervous System Disorders

Reduced RA signaling has been associated with several neurodegenerative diseases including AD, HD, or PD. Our analyses provide evidence for a strong association between RARβ loss of function and pathophysiology of HD, and suggest mechanisms through which compromised RA signaling may contribute to other neurodegenerative diseases including AD and PD. A link between RARβ and HD was suggested by previous transcriptomic studies of HD patients [20] or mouse models [21, 31]. Here, we report that expression of RARβ is significantly reduced at transcript and protein level in early symptomatic R6/2 mice, a transgenic mouse model of HD. We also show that such a reduction may result from partial sequestration of RARβ in aggregates of mutant HTT protein, which may further compromise RARβ signaling. The mechanism of sequestration is not known; thus, we cannot exclude direct interactions of RARβ with mutant HTT, and/or indirect co-sequestration of RARβ with CBP and N-CoR, which are known to interact both, with RARs [50, 51] and with mutant HTT [52, 53].

Our genome-wide identification of RARβ binding sites allowed to establish a list of potential RARβ transcriptional targets in the striatum, which is highly enriched in genes which expression is affected in HD patients. Functional annotation of 287 such genes pointed to the possibility of a RARβ-dependent multilevel control of mitochondrial functions, including oxidative phosphorylation. Although RARβ-dependent regulation of those genes remains to be formally demonstrated, deficient control of mitochondrial functions may provide a mechanistic link between pathophysiology of HD, or other neurodegenerative diseases for which compromised RA signaling has been documented [1518, 21, 54], and which all show mitochondrial abnormalities (for review see [55]).

Among genes with RARβ binding sites, we also identified a number of transcriptional regulators which are specifically associated with HD, but not AD or PD. Such observation supports a hypothesis on the pathogenic mechanism of HD, which involves global deficits in activity of different transcription factors [56, 57]. One of such mechanisms is sequestration and reduced availability of several ubiquitous transcription factors like CBP, N-CoR, p53, Sp1, TAF4 (TAFII130), and TBP [52, 53, 5862]. Reduced RARβ levels in HD could also indirectly impact transcription in the striatum by affecting expression of RNA polymerase II subunits (Polr2a, Polr2i), several transcription factors (e.g., Meis2, Sp1, Notch2, Foxp1, Nr1d1) and cofactors (e.g., Ppargc1a, Rcor1), and chromatin-modifying enzymes (e.g., Hdac2, Kdm3a), all of which except Polr2a and Rcor1 are known to be impaired in HD [20, 6365]. RARβ could also indirectly impact transcription by direct regulation of Foxp1 expression, another transcription factor implicated in HD [66]: indeed, we found several indirect targets of RARβ known to be regulated by FOXP1 in striatal cells.

Further insight into RARβ effects on general transcription mechanisms comes from detailed analyses of RARβ binding sites in 29 genes which expression was affected in the RARβ−/− striatum as well as in HD. These analyses revealed that in HD-relevant transcriptional targets, the RARβ binding sites overlapped with broad peaks of H3K4me3 occupation, shown to ensure the precision and robustness of gene expression which should be particularly high in tissues where function of a given gene is critical [67]. Intriguingly, none of the HD-associated RARβ transcriptional targets contained a TATA box in their promoter, suggesting that transcriptional control by RARβ does not directly involve TBP, but possibly other TBP-associated factors (TAFs). It is tempting to speculate that TATA-less gene promoters could be particularly susceptible to compromised RARβ signaling, suggesting also that such promoters rarely used in studies of retinoid signaling may be in fact more suitable than TATA box-containing promoters for testing the functionality of RAREs.

A Dorso-ventral Gradient of Increasing Susceptibility to RARβ Signaling in the Striatum

Transcriptomic analysis of RARβ−/− mice revealed significant alterations in gene expression in the NAcSh, which were confirmed for a selected group of genes using qPCR. Strikingly, only few of such transcriptional changes were also observed in the CPu. Our data therefore indicate an important regional specificity in RA control of gene transcription, as there appears to be a clear dorso-ventral gradient of increasing transcriptional effects of RA signaling in the striatum. A high susceptibility of the NAcSh to altered RA signaling may be of relevance for understanding some aspects of cognitive and affective symptoms observed in neurodegenerative disorders associated with reduced RA signaling. In particular, compromised GPCRs signaling may underlie depressive-like symptoms in the context of neurodegeneration, and may contribute to the pathophysiology of some psychiatric diseases such as depression or schizophrenia [5, 6870]. RARβ-controlled signaling pathways revealed in our study indicate that reduced RA signaling may contribute to the atrophy of NAcSh, one of the important symptoms of HD, PD, and AD [28, 7173]. Two, non-mutually exclusive mechanisms explaining NAcSh atrophy may involve reduced neurite outgrowth and synaptogenesis resulting from reduced expression of Scn4b, known to control cell morphology in physiological and pathological conditions [74], and cell death resulting from impaired Ca2+ signaling (as discussed above) and abnormal mitochondrial functions.

Architecture of RARβ Binding Motifs

In addition to genomic and functional annotations of potential and bona fide RARβ target genes, our study is also informative about the mode of RARβ control of gene transcription in mouse brain in vivo. In agreement with data obtained from the analysis of mouse embryoid bodies or embryonal carcinoma F9 cells [39], we found that DR0, DR2, DR8, DR5, and IR0 were the most abundant putative RAREs, suggesting a higher preference of RARs binding to these motifs. In complement to those observations, we found DR1 as new frequently occupied motifs in the striatum. Identification of numerous DR8 (for instance, in the Drd2, Gprin3, Mapk4, Cdh24, or Myo5c genes) confirms that these previously unrecognized elements are a frequent signature of RAR binding in vivo, as first suggested by experiments performed on embryonic stem cells [39]. The functional relevance of a high representation of DR0 (for instance, in Cbfa2t3, Stac2, Rora, Scarb1, or Rnf144b) has been questioned, as this element was reported to be nonfunctional in vitro when placed in front of minimal promoter, possibly due to steric hindrance in binding of the RAR-RXR heterodimer [39]. However, DR0 elements may contribute to composite DRs, as suggested by Moutier and colleagues. This possibility is supported in our study by the observation of a frequent coexistence of multiple DRs including DR0, but also DR1, DR3, DR6, and DR10, within individual peaks corresponding to native RARβ DNA binding regions.

We found that 21 % of RARβ binding sites were located intergenically, whereas 24 % of these binding sites were located in the promoter regions with a very high probability (10 % of all RARβ binding sites) of being positioned in the core and proximal promoter. This observation encouraged us to examine the distribution of core elements in the promoters of direct transcriptional targets of RARβ. Intriguingly, for this pool of genes, we did not find any TATA box motifs in core promoters, but only putative Inr, DPE, and BRE motifs, suggesting that RARβ may contribute to differential regulation of distinct core promoter element activities. Such differential regulation has also been reported for negative cofactor 2 (NC2), which may function specifically to activate DPE- and suppress TATA-dependent core promoter activity [75]. Furthermore, DPE core elements may also be regulated by RARβ indirectly. Critical for such regulation could be RARβ interaction with Nuclear Receptor Co-Repressor (N-CoR), which by binding to TAF6 and TAF9 subunits of the TFIID complex [76] could act via DPE elements [77]. The differential association of RARβ with distinct core promoter elements might be an additional “check point” ensuring a high level of transcriptional accuracy and target gene specificity of RARβ transcriptional regulations.

Conclusions

We present here the first genome-wide analysis of RAR binding sites in the brain, and provide a compendium of potential transcriptional targets of RARβ, the major RAR in the striatum and an important regulator of mammalian development—also investigated as a possible tumor suppressor gene. Globally, our analyses point to a strong contribution of RARβ in controlling neurotransmission, energy metabolism, cell morphology and transcription, with a particular involvement of G-protein, cAMP, and calcium signaling (Fig. 6). These regulations may be of relevance for better understanding the pathophysiology of neurodegenerative diseases associated with compromised RA signaling and point to a potential neuroprotective activity of RARβ. Relevance of such findings for HD is further supported by reduction of RARβ expression in R6/2 transgenic mice resulting from partial sequestration of the receptor in HTT aggregates in this animal model of Huntington’s disease. Our data pave the way for future functional studies on a gene to gene basis to characterize RARβ regulation of transcriptional target genes in the context of striatal physiology and pathology.

Fig. 6
figure 6

Scheme of major functions affected by RARβ deficit in the striatum and impaired in Huntington’s disease

Materials and Methods

Animals

RARβ knockout (RARβ−/−) mice and their WT control littermates were raised on a mixed genetic background (C57BL/6J and 129SvEms/j) as described [78]. To obtain R6/2 transgenic mice animals, WT males on a C57BL/6J x CBA F1 background were crossed with WT females transplanted with ovaries from transgenic HD mice of the R6/2 strain, purchased from Jackson Laboratories (USA). All mice were housed in individually ventilated cages, type “MICE” (Charles River, France) in a 7 am–7 pm light/dark cycle. Food and water were freely available. All experiments were carried out in accordance with the European Community Council Directives of 24 November 1986 (86/609/EEC) and in compliance with the guidelines of CNRS and the French Agricultural and Forestry Ministry (decree 87848).

Chromatin Immunoprecipitation (ChIP) and Sequencing

Chromatin was prepared from four freshly dissected striata enriched for ventral part of striatum. Following cross-linking in 1 % paraformaldehyde (PFA) at room temperature (RT) for 12 min, glycine was added to a final concentration of 0.46 M and incubated for 10 min at RT. Samples were washed in cold PBS/PIC and homogenized in lysis buffer (50 mM Hepes K salt pH 7.9, 1 mM EDTA, 0.13 % Triton X-100, 0.1 % Na-deoxycholate, 0.75 % SDS, 1× PIC). Chromatin was sonicated for 10 min using a Covaris untrasonicator and centrifuged at 16,000 rpm for 5 min. The supernatant was used for subsequent ChIP using antibodies against RARβ, H3K4me3, or GFP (sc-552, Santa Cruz; ab8580, Abcam; ab290, Abcam, respectively). Each 200 μl chromatin sample was diluted 7.5× to a final concentration of 50 mM Hepes K salt pH = 7.9, 140 mM NaCl, 1 mM EDTA, 1 % Triton X-100, 0.1 % Na-deoxycholate, 0.1 % SDS, and 1× PIC. Diluted chromatin was precleared with 40 μl of ProtA beads (Millipore, 16-157) for 45 min at 4 °C, followed by overnight (o/n) incubation with 3 μg of anti-RARβ, -GFP, or -H3K4me3 antibodies at 4 °C. ProtA-coated beads were added for a 3-h incubation at 4 °C. The beads were washed two times with each of consecutive buffers at 4 °C: IP buffer, buffer A (50 mM Hepes pH 7.9, 500 mM NaCl, 1 mM EDTA, 1 % Triton X-100, 0.1 % Na-deoxycholate, 0.1 % SDS, 1× PIC), buffer B (20 mM Tris pH 8, 1 mM EDTA, 250 mM LiCl, 0.5 % NP40, 0.5 % Na-deoxycholate, 1× PIC), and TE. DNA-protein complexes were eluted from the beads, decrosslinked, and treated with proteinase K for 2 h at 43 °C. DNA fragments were purified using phenol-chloroform, precipitated, and analyzed by qPCR or used for sequencing.

ChIP-seq libraries were prepared using NEXTflex ChIP-Seq Kit (Bio Scientific) following the manufacturer’s protocol (V12.10) with some modifications. Briefly, 10 ng of ChIP enriched DNA was end repaired using T4 DNA polymerase, Klenow DNA polymerase, and T4 polynucleotide kinase, size selected, and cleaned-up using Agencourt AMPure XP beads (Beckman). A single “A” nucleotide was added to the 3′ ends of the blunt DNA fragments with the Klenow enzyme. The ends of the DNA fragments were ligated to double-stranded barcoded DNA adapters (NEXTflex ChIP-Seq Barcodes - 6, Bioo Scientific) using T4 DNA ligase. The ligated products were enriched by PCR and cleaned-up using Agencourt AMPure XP beads. Prior to analyses, DNA libraries were checked for quality and quantified using a 2100 Bioanalyzer (Agilent). The libraries were loaded in the flowcell at 8 pM concentration and clusters were generated using the Cbot and sequenced on an Illumina Hiseq2500 system as single-end 50 base reads following Illumina’s instructions.

Bioinformatical Analysis

Raw data were analyzed by the Illumina CASAVA 1.8.2 and aligned to the mm9 genome with Bowtie 0.12.7. Peak detection was performed using the MACS 1.4.2 software [79] under settings where an anti-GFP ChIP was used as a negative control. We used a default cutoff p value at 1e−05, no model build by MACS and a customized shiftsize as 108 bp was set to get an optimizing result. Furthermore, peaks were annotated using GPAT [23] with a window search of 20 kb. For the compatibility of ChIP-seq and transcriptome datasets, we used Homer [35] to annotate the peaks with respect to the coordinates of the beginning and end of Ensembl genes (release 67). A custom JAVA application was used to detect the frequency of DRs, IRs, and ERs in the total RARβ peaks. The same application was also used for the core promoter elements analysis. The core promoter sequences were selected for the transcripts with the closest RARβ peaks. Cluster comparison of ChIP-seq data sets was performed with seqMINER [80]. The statistical significance of transcriptomic changes in the striatum between WT and RARβ−/− animals was confirmed by performing GSEA [81].

Bootstrap Analysis

To verify the statistical significance of the obtained cluster 1, cluster 2, or cluster 3-bound transcript groups in Fig. 4d, we performed bootstrap statistical analyses. In these analyses, we used the total pool of 26,460 Ensembl genes. Next, we randomly selected 6273 genes in the Ensembl total pool. This random selection was then compared with the transcript lists corresponding to different clusters (1045 transcripts for cluster 1, 2595 transcripts for cluster 2, and 2633 transcripts for cluster 3) and the number of transcripts (IDs) belonging to the non-random experimental group was determined. We repeated this process of random selection and gene list crossings 10,000 times and represented the number of IDs and their observed frequencies as histograms (see corresponding Supplementary Fig. S3). For each transcript list, we computed an average (mean) and a standard deviation (sd) of the number of random matches. A z-score is computed as: z = (mean-expect)/sd, where “expect” is the number of expected interest genes. For the three clusters, we obtained a p value lesser than 1.0e−16. The p value represents the significance of the difference between the randomly found average and the experimental ID numbers.

Immunofluorescence

Brain samples were fresh frozen, and cryosections (14 μm) were collected and postfixed for 10 min in 4 % PFA, followed by washes in PBST (PBS/0.1 % Triton X-100), blocking with 10 % normal goat serum and incubation (o/n at 4 °C) with primary antibodies: anti-RARβ (sc-552), anti-hHTT recognizing amino acids 50–64 of human Huntingtin (2B4). Secondary antibodies conjugated with Alexa 555 and Alexa 488 fluorophores and DAPI were used for detection. For all experiments, four animals of each genotype were analyzed.

Confocal Microscopy

Images were obtained using a SP8 Leica inverted-based microscope with ×63 objective and with zoom factor 2.5. To perform quantitative fluorescent measurements of RARβ expression level, DPSS561 laser power was kept constant for all acquisitions (each Z-stack in WT and transgenic brain sections). Quantitative fluorescence measurements and colocalization studies were performed using the Imaris software by creating the mask of nuclei and mask of HTT aggregates, which were next used to calculate the mean intensity of RARβ signal in corresponding regions. Quantification was carried out on 20–40 cells/animal for each striatal subregion. Data were analyzed using two-way ANOVA with genotype and striatal region as two independent variables or by two-tailed, unpaired t test for post-hoc analyses or two group comparisons.

Transcriptome (DNA Microarray) Analysis

Dissection of the Nucleus Accumbens and RNA Extraction

Behaviorally naive RARβ−/− mice (n = 6) and their WT littermates (n = 5) were sacrificed by cervical dislocation at the age of 4 months. Brains were fresh frozen in Shandon Cryomatrix (Thermo Scientific) and kept at −80 °C until use. NAcSh was dissected bilaterally under a stereomicroscope (Leica), from three subsequent 300 μm cryosections using 0.5 mm corer. RNA was isolated using the RNeasy Micro Kit (Qiagen) and kept at −80 °C for further analysis.

Hybridization on Microarrays and Analysis

The quality of RNA was determined by capillary electrophoresis in a 2100 Bioanalyzer (Agilent). RNA was next used to synthesize sense-strand cDNA, which was further labeled, fragmented, and hybridized on Affymetrix GeneChip Mouse Gene 1.0 ST arrays. Raw microarray data were normalized using a log scale robust multi-array analysis (RMA) [82, 83] in the Partek Genomics Suite 6.5 software and subsequently subjected to principal components analysis (PCA) in order to assess samples distribution. Only transcripts which were found to be significantly expressed (hybridization signal value above 5.76—30th percentile of all expression values) were retained for further analyses. Differences in gene expression were evaluated for average intensity signal for WT and RARβ−/− groups and expressed as log2 of the ratio of RARβ−/− to WT value (fold change, FC). The statistical significance of gene expression was assigned using two-tailed, unpaired t test. Genes were considered to be significantly regulated if the FC of gene expression was 0.8 ≥ FC ≥ 1.2 at p value <0.05 (FDR = 0.2, Benjamini and Hochberg method). Using another statistical method FCROS [84], we found almost the same selection (>95 % of common IDs) with smaller error (10 %). Stringent threshold level (t test threshold = 0.01, FDR = 0.09) was used for the volcano plot and for gene validation. Gene functional annotation was performed using Ingenuity (t test threshold = 0.05). Additional analyses of an overlap between HD-deregulated genes reported in the literature and RARβ target genes were also performed by analyses of hypergeometric distribution using phyper from R software library.

Quantitative RT-PCR

Quantitative RT-PCR (qPCR) was performed on RNA samples used for microarray hybridization and from an additional group of four mice/genotype. cDNA was synthesized using QuantiTect Reverse Transcription Kit (Qiagen) according to the manufacturer’s protocol. The real-time qPCR reactions were performed in a LightCycler 480 (Roche) using gene-specific primers (Supplementary Table S8) and QuantiFast SYBR Green PCR Kit (Qiagen). The amount of transcripts was evaluated relatively to the expression level of the housekeeping gene acidic ribosomal phosphoprotein P0 (Rplp0 or 36B4). Statistical analysis was performed using two-tailed, unpaired t test.

Data Access

The data have been submitted to the NCBI Gene Expression Omnibus (GEO) (http://www.ncbi.nlm.nih.gov/geo/) under accession No. GSE67829, and GSE67761.