Structural organization of the inactive X chromosome in the mouse

Giorgetti, Luca; Lajoie, Bryan R.; Carter, Ava C.; Attia, Mikael; Zhan, Ye; Xu, Jin; Chen, Chong Jian; Kaplan, Noam; Chang, Howard Y.; Heard, Edith; Dekker, Job

doi:10.1038/nature18589

Structural organization of the inactive X chromosome in the mouse

Letter
Published: 18 July 2016

Volume 535, pages 575–579, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

From

View current issue Submit your manuscript

Structural organization of the inactive X chromosome in the mouse

Download PDF

Luca Giorgetti¹^na1^nAff7,
Bryan R. Lajoie²^na1,
Ava C. Carter³^na1,
Mikael Attia¹^na1,
Ye Zhan²,
Jin Xu³,
Chong Jian Chen¹,
Noam Kaplan²,
Howard Y. Chang³,
Edith Heard^1,4 &
…
Job Dekker^2,5

31k Accesses
272 Citations
148 Altmetric
16 Mentions
Explore all metrics

Abstract

X-chromosome inactivation (XCI) involves major reorganization of the X chromosome as it becomes silent and heterochromatic. During female mammalian development, XCI is triggered by upregulation of the non-coding Xist RNA from one of the two X chromosomes. Xist coats the chromosome in cis and induces silencing of almost all genes via its A-repeat region^1,2, although some genes (constitutive escapees) avoid silencing in most cell types, and others (facultative escapees) escape XCI only in specific contexts³. A role for Xist in organizing the inactive X (Xi) chromosome has been proposed^4,5,6. Recent chromosome conformation capture approaches have revealed global loss of local structure on the Xi chromosome and formation of large mega-domains, separated by a region containing the DXZ4 macrosatellite^7,8,9,10. However, the molecular architecture of the Xi chromosome, in both the silent and expressed regions, remains unclear. Here we investigate the structure, chromatin accessibility and expression status of the mouse Xi chromosome in highly polymorphic clonal neural progenitors (NPCs) and embryonic stem cells. We demonstrate a crucial role for Xist and the DXZ4-containing boundary in shaping Xi chromosome structure using allele-specific genome-wide chromosome conformation capture (Hi-C) analysis, an assay for transposase-accessible chromatin with high throughput sequencing (ATAC–seq) and RNA sequencing. Deletion of the boundary disrupts mega-domain formation, and induction of Xist RNA initiates formation of the boundary and the loss of DNA accessibility. We also show that in NPCs, the Xi chromosome lacks active/inactive compartments and topologically associating domains (TADs), except around genes that escape XCI. Escapee gene clusters display TAD-like structures and retain DNA accessibility at promoter-proximal and CTCF-binding sites. Furthermore, altered patterns of facultative escape genes in different neural progenitor clones are associated with the presence of different TAD-like structures after XCI. These findings suggest a key role for transcription and CTCF in the formation of TADs in the context of the Xi chromosome in neural progenitors.

Enhanced chromatin accessibility contributes to X chromosome dosage compensation in mammals

Article Open access 01 November 2021

Integrated analysis of Xist upregulation and X-chromosome inactivation with single-cell and single-allele resolution

Article Open access 15 June 2021

Dynamics of gene silencing during X inactivation using allele-specific RNA-seq

Article Open access 03 August 2015

Main

To investigate the structure of the Xi chromosome, we performed allele-specific Hi-C in a clonal neural progenitor cell (NPC) line that was derived from highly polymorphic F₁ mouse embryonic stem (ES) cells (129 × Cast, Extended Data Fig. 1a–f). We first performed Hi-C in ES cells, in which XCI has not yet occurred, and found that autosomes and both active X chromosomes displayed prominent active/inactive (A/B) compartmentalization and TAD structures (Extended Data Figs 2a–c and 3). In NPCs, compartments and TADs were similarly detected on autosomes and the active X (Xa) chromosome (Fig. 1a, Extended Data Fig. 3a). Notably, however, the Xi chromosome displayed no A/B compartments (Extended Data Fig. 2c), but was instead partitioned into two massive interaction domains separated by a hinge region of ~200 kb including the DXZ4 macrosatellite^11,12 (Fig. 1a), as recently reported for the human and mouse Xi chromosomes^7,8,9. Furthermore, TADs were found to be largely absent on the Xi chromosome (Fig. 1a), as previously suggested^9,10.

**Figure 1: The distinct conformation of the Xi and Xa chromosomes.**

To investigate the spatial organization of mega-domains at the single-cell level, we performed DNA fluorescence in situ hybridization (FISH) with 18-Mb probe sets located within one mega-domain (probes a–b), or spanning the mega-domain boundary (probes b–c) (Fig. 1b, Extended Data Fig. 4a). Despite extensive cell-to-cell variation, regions within the same mega-domain showed greater overlap on the Xi than on the Xa chromosome (Fig. 1b, c, Extended Data Fig. 4b), whereas regions spanning the boundary showed lower overlap on the Xi chromosome, in agreement with Hi-C data (Fig. 1c). Similar results were obtained in an NPC clone with a Cast Xi chromosome and in astrocytes (Extended Data Fig. 4c). Using two independent quantification methods (Extended Data Fig. 4d–f), the volume of each 18-Mb region was found to be modestly (approximately 20%) but significantly smaller on the Xi than the Xa chromosome, consistent with observations on the human Xi chromosome^13,14,15. Thus, the mouse Xi chromosome is moderately compacted and partitioned into two large, spatially distinct domains that show varying degrees of overlap within the cell population (Fig. 1b, c, Extended Data Fig. 4).

Although the Xi chromosome globally presented no TADs in NPCs, we could detect a few residual TAD-like structures (Fig. 1a, black arrow). Integration of Hi-C, RNA sequencing (RNA-seq)¹⁶ and ATAC–seq data produced in the same lines revealed that these Xi chromosome TAD-like structures correspond to hotspots of residual transcription and open chromatin. The RNA-seq and ATAC–seq profiles are consistent with global inactivity of the Xi chromosome (Extended Data Fig. 5a), with a 75% reduction in both the number of expressed genes and accessible elements compared to the Xa chromosome. Most ATAC–seq peaks on the Xi chromosome fall in the pseudoautosomal region, the Xist locus and at the promoters of genes that escape XCI either facultatively or constitutively³ (Extended Data Fig. 5a). The amount of local structure on the Xi chromosome correlates with the density of transcribed loci and accessible elements (Extended Data Fig. 5d), as shown in Fig. 2a by three examples: a cluster of 19 facultative escapees including Mecp2, overlapping a strong ~800-kb TAD-like structure; the Xist locus with moderate interactions across a ~250-kb region; and 5 escapees including the constitutive Jarid1c (also known as Kdm5c) and the facultative Huwe1 genes, embedded in a ~500-kb TAD. Importantly, residual TAD-like structures on the Xi chromosome occasionally coincide with sub-TAD structures on the Xa chromosome (Fig. 2a). Generally, escapees are located in Xi chromosome regions with higher TAD strength and chromatin accessibility as compared to silenced genes (Fig. 2b, c, Extended Data Fig. 5b). Thus, in NPCs, the appearance of TAD structures is intimately linked to gene expression on the Xi chromosome, unlike on the Xa chromosome and autosomes, where TADs are present even in the absence of transcription.

**Figure 2: Expression, chromatin accessibility and chromatin conformation along the Xi chromosome.**

Notably, 51% of Xi chromosome accessible sites are <5 kb from a promoter compared to ~35% on the Xa chromosome (Fig. 2d), suggesting that escape is often regulated through promoter-proximal sites. Most ATAC–seq peaks on the Xi chromosome were found at CTCF-binding sites (Extended Data Fig. 5c), implicating CTCF in escape. Recent findings suggest that cohesin (which co-localizes with CTCF) is globally lost on the Xi chromosome, which may lead to the chromosome-wide loss of TADs⁹. Our discovery that only escapees show TAD-like structures on the Xi chromosome, and that they are associated with putative CTCF sites, is consistent with a role for CTCF in TAD formation and/or maintenance^17,18,19.

Escapees on the Xi chromosome also tend to interact with each other even across the mega-domain boundary, consistent with previous circularized chromosome conformation capture (4C) analysis results⁵ (Fig. 2e and Extended Data Fig. 2d, e).

To investigate the importance of the unusual bipartite organization of the Xi chromosome we deleted the ~200 kb boundary region encompassing the DXZ4 locus, specifically on the 129 allele in ES cells (ΔFT) (Extended Data Figs 1b and 6a). After differentiation, many NPC clones were isolated with a 129 (ΔFT) Xi chromosome. Deletion of the boundary did not affect XCI establishment, as NPC clones with either a wild-type or ΔFT Xi chromosome were obtained. Hi-C performed on one such clone (D9B2) revealed massive reorganization of the ΔFT Xi chromosome resulting in fusion of the two mega-domains (Fig. 3a). No effect was visible on the Cast Xa chromosome (Extended Data Fig. 7a). DNA FISH confirmed that sequences on either side of the deleted boundary overlap significantly more on the ΔFT Xi chromosome compared to wild type, consistent with Hi-C data (Extended Data Fig. 8a).

**Figure 3: Deletion of the mega-domain boundary leads to loss of bipartite folding.**

Notably, in the D9B2 clone, facultative escapees (Mecp2, Huwe1) no longer escaped from XCI on the ΔFT Xi chromosome (Extended Data Fig. 6b-c). ATAC–seq and RNA-seq confirmed substantially reduced escape on the ΔFT Xi chromosome, with only 29 expressed genes (Extended Data Fig. 8b). Transcription and chromatin accessibility were lost at 66 of the 87 facultative escape genes (~76%) (Fig. 3b), but maintained at all 6 constitutive escapees (Fig. 3b, Supplementary Table 1 and Extended Data Fig. 8c). However, analysis of multiple NPC clones, both wild type and ΔFT, revealed only a slight tendency for ΔFT NPC clones to show less escape than wild type, with varying degrees of facultative escape between clones, with or without the mega-domain boundary (Extended Data Fig. 6d). Nevertheless, clone D9B2 provided a unique opportunity to study the relationship between transcriptional activity and chromosome conformation on the Xi chromosome.

TAD-like structures were absent on the Xi chromosome when expression was lost at facultative escapees in the D9B2 clone (Fig. 3b and Extended Data Fig. 7b). Xi chromosome-wide comparisons between the D9B2 (ΔFT) and wild-type NPCs revealed a strong correlation between loss of escape, loss of chromatin accessibility and reduction in TAD signal (Fig. 3c). Specific long-range interactions between escapees were also lost on the ΔFT Xi chromosome (Fig. 3d). Eight genes showed de novo escape in the D9B2 clone (Supplementary Table 1). These de novo escapees were not clustered or highly accessible by ATAC–seq, and their expression did not lead to strong TAD formation (data not shown).

Of the 224 Xi chromosome ATAC–seq peaks that we detected in the wild-type clone, 139 were lost in the D9B2 clone (Fig. 3e, Extended Data Fig. 8b). These lost sites were enriched for promoter-proximal location (64%) (Fig. 3f, g). In total, 93% of these promoter-proximal sites contain CTCF-binding sites, an enrichment compared to the 64% of promoter-distal sites (Fig. 3f). These CTCF sites are closer to escape gene transcription start sites (TSSs) than unchanging sites, again pointing to a role for promoter-proximal CTCF in escape gene regulation (Fig. 3h).

We next investigated the role of Xist in establishing the unusual organization of the Xi chromosome. We induced Xist expression in undifferentiated male (XY) ES cells carrying a tetracycline-inducible promoter at the endogenous Xist locus¹. Hi-C revealed that 48 h of Xist induction resulted in notable structural changes along the X chromosome, and these changes were not observed after induction of a Xist mutant lacking the A-repeat region (Fig. 4a, b)—which cannot silence genes (Extended Data Fig. 9a), but is competent for Xist coating and exclusion of RNA polymerase II (refs 1, 4). Wild-type Xist induction did not lead to detectable changes in TAD structure (data not shown), but resulted in increased interaction frequencies along the chromosome (Fig. 4b, d). The contact map of the wild-type Xist-coated X chromosome was found to be more similar to that of the NPC Xi chromosome than either the non-induced or the A-repeat mutant Xist-coated X chromosome (Fig. 4a). Notably, physical separation across the mega-domain boundary occurred (Fig. 4b), confirmed by RNA/DNA FISH (Fig. 4c, Extended Data Fig. 9b). Induction of wild-type Xist from one X chromosome in female ES cells²⁰ generated a boundary of comparable magnitude (Extended Data Fig. 9c). ATAC–seq showed that 48 h of wild-type (but not A-repeat mutant) Xist expression results in globally reduced (but not eliminated) accessibility on the X chromosome (Fig. 4b, e). Interestingly, a small number of loci showed increased chromatin accessibility after wild-type Xist coating including the Firre long noncoding RNA (Fig. 4b), which together with the DXZ4 macrosatellite has been proposed to anchor the Xi chromosome to the nucleolus²¹. Regions of reduced chromatin accessibility tend to show increased interaction frequencies with loci in the surrounding ~20 Mb of genomic sequence (Extended Data Fig. 9d). We note that conformational changes on the Xist-coated X chromosome in male ES cells were relatively mild compared to the NPC Xi chromosome, probably for several reasons. First, Xist was induced in ~35–45% of cells (in two independent replicates). Detecting structural alterations in such a sub-population may be difficult by Hi-C, and this may explain our observation that TADs are still detected in the overall cell population. Second, the Xist-coated Xi chromosome chromatin state in ES cells may not be comparable to that in differentiated NPCs²².

**Figure 4: *Xist*-mediated silencing is sufficient to generate a boundary at *DXZ4* in ES cells.**

In conclusion, our study uncovers a complex molecular architecture of the Xi chromosome, and reveals new insights into TAD formation. The Xi chromosome is moderately compacted and organized into two mega-domains, with global loss of TADs, except at clusters of expressed genes. Thus, in contrast with the notion that TADs are highly stable across differentiation and do not require transcription for their presence or maintenance^10,23, our findings demonstrate that (1) TADs can be lost in the context of the Xi chromosome, at least in NPCs; and (2) transcription together with binding of factors such as CTCF may enable their maintenance or de novo re-creation.

The bipartite folding of the Xi chromosome into two mega-domains is evolutionarily conserved, pointing to a possible role for this peculiar organization in the XCI process. Deletion of the boundary region, leading to fusion of the two mega-domains, does not affect initiation of XCI. However, somewhat reduced rates of facultative, but not constitutive, escape are observed. Facultative escapees are silenced during XCI and then re-expressed^24,25 and may be more prone to stochastic events and/or the influence of long-range interactions with other parts of the X chromosome, such as the CTCF-rich DXZ4 region²⁶. As promoter-proximal CTCF sites characterize both facultative and constitutive escape genes, we speculate that transient interactions of escapees with the boundary region during XCI may influence escape (Extended Data Fig. 9e). However, the highly variable facultative escape that we found between NPC clones, both with or without the DXZ4 boundary, suggests that escape efficiency cannot be due just to the presence of the DXZ4 region, but may be influenced by local folding and accessibility to CTCF binding. The exact timing and mechanism of facultative escape, and its relationship with TAD formation, will require further investigation. In conclusion, our study establishes the Xi chromosome as a powerful system for studying the mechanistic relationships between chromosome conformation and gene regulation, and points to key roles for gene activity and CTCF in the establishment of TAD structure in the context of facultative heterochromatin.

Note added in proof: Consistent with our observations in mouse, deletion of DXZ4 from the human Xi chromosome results in loss of mega-domain structure (E. Lieberman Aiden & B. Chadwick, Deletion of the macrosatellite DXZ4 on the human inactive X chromosome alters higher-order genome architecture. Proc. Natl Acad. Sci USA (in press)).

Methods

No statistical methods were used to predetermine sample size. The experiments were not randomized, and investigators were not blinded to allocation during experiments and outcome assessment.

Cell culture

The hybrid mouse ES cell line F121.6 (129Sv-Cast/EiJ), a gift from J. Gribnau, was grown on mitomycin-C-inactivated mouse embryonic fibroblasts in ES cell media containing 15% FBS (Gibco), 10-⁴M β-mercaptoethanol (Sigma), 1,000 U ml⁻¹ leukaemia inhibitory factor (LIF, Chemicon). Male-inducible TXY and TXY:ΔA lines (a gift from A. Wutz, called Xist-tetOP and Xist-ΔSX-tetOP, respectively, in ref. 1) were cultured in the same conditions and treated for 48 h with 2 μg ml⁻¹ doxycycline. Differentiation of F121.6 ES cells into NPCs was performed as previously described¹⁶. Subcloning of NPCs was made by limiting dilution and manual colony picking. All cells used in this study were characterized for absence of mycoplasma contamination.

Boundary deletion

To generate the boundary region deletion, 5 × 10⁶ ES cells were transfected with 5 μg each of two plasmids (pX459) each expressing Cas9 and a chimaeric guide RNA (gRNA1: CATGTTTGAGCATGGAAACCCGG, chrX:72823838–72823860; gRNA2: GGGTTATGGCGGTCGGTTCCTGG, chrX:73025513–73025535). Subcloning of ES cells was made by limiting dilution. Cells were treated for 24 h with puromycin. As soon as visible, single colonies were picked under a microscope to be screened for deletion by PCR (forward primer: 5′-CGTAGACGCGGCAGTAGTTT-3′, reverse primer: 5′-ACATAAACTCCTTTTCAGGACCA-3′). To identify the targeted allele, we performed a PCR using primers (forward: 5′-CTGTCCAAATGGAGGTGCTT-3′, reverse: 5′-CCTAGGTCCGCTCTCTATCG-3′) that amplify a 203-bp amplicon specifically on the wild-type allele, which contains a single nucleotide polymorphism (SNP; rs29035891). After amplification, PCR products were gel-purified and sequenced using the forward or reverse primer used for PCR. Clones positive carrying the deletion were expanded and differentiated into NPC as previously described¹⁶ and subcloned by limiting dilution. NPC lines were maintained in N2B27 medium supplemented with EGF and FGF (10 ng ml⁻¹ each), on 0.1% gelatin-coated flasks. Clones carrying the boundary deletion on the inactive X were identified by RNA FISH against Xist with the p510 plasmid probe and DNA FISH with a BAC hybridizing inside the deleted region (RP23-299L1).

RT–PCR and pyrosequencing

RNA extraction and cDNA synthesis were performed with the Cells-to-Ct kit (Ambion) following the manufacturer’s instructions. Pyrosequencing primers were designed using the PyroMark Assay Design software. PCR products were purified and annealed with sequencing primers for pyrosequencing using the PyroMark q24 (Qiagen) (Xist: forward primer 5′-AGAGAGCC CAAAGGGACAAA-3′, reverse primer 5′-TGTATAGGCTGCTGGCAGTCC-3′, sequencing primer 5′-GCTGGCAGTCCTTGA-3′; Mecp2: forward primer 5′-CTGAAGGTTGTAGTGGCTCATG-3′, reverse primer 5′-ATGGTAGCTGG GATGTTAGGG-3′, sequencing primer 5′-CAGAGACAAGCCACTGA-3′; Huwe1:forward primer 5′-GCAGGTGTCTGCAAATCCA-3′, reverse primer 5′-GCCGATGTAAAGGCTCCAAG-3′, sequencing primer 5′-TGGGTTCATGT GACAG-3′; Jarid1c: forward primer 5′-GCTGCCTCCTTTGCCTGAAAT-3′, reverse primer 5′-TTCAGGGGGCCGCTTACA-3′, sequencing primer 5′-CTCCTTTGCCTGAAAT-3′).

Hi-C read mapping, binning, ICE correction

Hi-C was performed as previously described^27,28. To obtain allele-specific Hi-C interaction maps in female ES cells (Xa^castXa¹²⁹) and a derived clonal NPC line (Xa^castXi¹²⁹) (Methods; Extended Data Fig. 1)¹⁶, we first constructed an allelic genome using the reference mm9 genome and all 19,722,473 SNPs. The allelic (Cast and 129) genomes were then combined to create a reference diploid genome (consisting of 44 chromosomes; 1–19 X,Y,M). All reads were aligned to the diploid genome (as described in ref. 29), thus allowing for a competitive mapping strategy between the two alleles. All reads were trimmed to 50 bp and then aligned using the novoCraft novoalign (v.3.02.00) software package. Reads were aligned using the following options (-r all 5 -R 30 -q 2 -n 50, minimumReadDistance = 5). The best alignment was selected from the list of the top 5 alignments. The alignment was considered unique (allelic), if its alignment score was ≥5 from the second best alignment score (alignment score taken from the ZQ tag). Reads that aligned uniquely to an allele were classified as allelic (either Cast or 129) whereas reads that aligned to both alleles equally (≤5 distance) were classified as ambiguous (AMB) (Extended Data Fig. 1d). Uniquely aligned Hi-C interactions between loci located on the same chromosome were assigned to a specific parental chromosome in cis when at least one of the two reads contained a diagnostic SNP, and the other either contained a SNP from the same allele, or mapped to both alleles³⁰. We obtained the following paired-end read counts: For ES cells (GUR.2d), a total of 401,684,614 interactions could be aligned combining the two replicates, 372,272,389 of which were unique (after PCR duplicate filter), and 95,650,438 of which could be placed to either the Cast or 129 allele (25.69%). For NPCs (GEI.72b), a total of 277,440,656 interactions could be aligned, 253,254,798 of which were unique (after PCR duplicate filter), and 82,323,031 of which could be placed to either the Cast or 129 allele (32.51%). For ΔFT NPCs (D9B2/B129T3), a total of 229,331,123 interactions could be aligned, 222,941,525 of which were unique (after PCR duplicate filter), and 85,331,870 of which could be placed to either the Cast or 129 allele (38.28%). The difference in percentage of reads assignable to either allele is probably due to differences in the percentage of cis interactions found in each sample (biological or technical variation). The 82–95 million read depth supported generation of allele-specific chromatin interaction maps at multiple resolutions (10 Mb, 2.5 Mb, 1 Mb, 500 kb, 250 kb, 100 kb and 40 kb).

Biological replicates were highly correlated. Pearson’s correlation coefficients for 500 kb data on chrX were as follows: EHSNP-mF1216__R1R2__chrX-129S1, 0.992331; EHSNP-mF1216__R1R2__chrX-cast, 0.990373; EHSNP-mNPe-deltaRF__R1R2__chrX-129S1, 0.976562; EHSNP-mNPe-deltaRF__R1R2__chrX-cast, 0.983614; EHSNP-mNPe__R1R2__chrX-129S1, 0.990976; EHSNP-mNPe__R1R2__chrX-cast, 0.995202. Autosomes showed similar correlation values. Overall these numbers indicate that the produced Hi-C data was of high quality and well correlated between biological replicates. We pooled all biological replicates into a single Hi-C data set per sample and subsequently used the pooled data for all analyses.

Iterative mapping and error filtering/iterative correction of the chromatin interaction data were performed as previously described^29,31. Iterative correction was performed on the diploid (44 chromosomes) (replicate pooled) genome-wide matrix for all resolutions.

Hi-C for the TXY (male) samples was performed as previously described^27,28. Reads originating from the TXY (male) sample was aligned to the mm9 reference genome. Iterative mapping and error filtering/iterative correction of the chromatin interaction data were performed as previously described^29,31. Iterative correction was performed genome-wide (22 chromosomes) (replicate pooled) on the genome-wide matrix for all resolutions. Biological replicates were highly correlated. We pooled all biological replicates into a single Hi-C data set per sample and subsequently used the pooled data for all analyses.

We obtained the following paired-end read counts: For TXY +dox, a total of 277,191,448 interactions could be aligned, 267,007,192 of which were unique (after PCR duplicate filter). For TXY WT −dox, a total of 308,671,996 interactions could be aligned, 300,102,244 of which were unique (after PCR duplicate filter). For TXY:ΔA +dox, a total of 281,116,218 interactions could be aligned, 273,612,976 of which were unique (after PCR duplicate filter). For TXY:ΔA −dox, a total of 298,436,664 interactions could be aligned, 289,376,893 of which were unique (after PCR duplicate filter).

Allele-specific read mapping validation (Hi-C and ATAC–seq)

To validate the accuracy of the allele-specific read alignment strategies used in this paper, we first constructed a set of validation reads tiled across all SNP locations between the Cast and 129 genomes. In brief, for each SNP location on the X chromosome, all overlapping 50-bp reads were extracted (50 total) for each of the 129 and Cast alleles (Extended Data Fig. 1i). All reads were then processed through the Hi-C and ATAC–seq mapping pipelines described in the methods to measure assignment accuracy. Encoded into each fastq readID, was the allelic genome that each read originated from, the relative-position (within the 50-bp read) of the SNP, the base-call of the SNP (A, C, T, G), the chromosome and position (start, end) of the read. The set of validation reads were then processed through the ATAC–seq allele-specific and Hi-C allele-specific pipelines with no modifications. Each processed read was then scored according to whether or not it was correctly placed to not only the correct chromosomal location, but to the correct allele as well. For the Hi-C allele-specific pipeline we found 0 reads assigned to the incorrect chromosomal location and 0 reads assigned to the incorrect allele. All processed validation reads were assigned to the correct coordinate and allele. Reads that span repetitive regions or are of low complexity were inherently filtered via the Hi-C pipeline and thus would be excluded from both the actual data and from the set of validation reads. For the ATAC–seq allele-specific pipeline, we found that for reads coming from the 129 X chromosome, only 0.09% mapped to the Cast chromosome (only 3 of these fell within ATAC–seq peaks). For reads coming from the Cast X chromosome, only 0.21% mapped to the 129 chromosome (5 of which fell within ATAC–seq peaks). Furthermore, we feel that our analysis may be even more accurate when using paired-end reads as we do for all ATAC–seq data analysis (Extended Data Fig. 1i).

Generation of Xist-positive Hi-C signal for comparison with DNA FISH

Xist RNA FISH performed in parallel with Hi-C on the same inducible Xist ES cell samples revealed that Xist expression (either wild-type or A-repeat mutant) was induced in ~35–45% of dox-treated cells. Hence the Hi-C signal can be represented as the sum of ~35–45% of reads coming from Xist-positive X chromosomes, and ~55–65% of reads that are generated from non-coated X chromosomes. For example,

We took advantage of the fact that the signal from Xist− cells is measured independently in the non-induced (−dox) sample:

to extract the signal of Xist-positive cells:

and hence

Negative values were assigned to ‘not analysed’ (NAs). The Hi-C(Xist+) signal was then used to compare Hi-C data with RNA/DNA FISH experiments in Fig. 4b, which allow to discriminate Xist-coated and non-coated chromosomes visually by the presence of an Xist cloud. In Fig. 4c, Xist− signals correspond to the −dox sample.

Hi-C SNP density filter

To remove potential biases in the Hi-C data related to the density of SNPs in each bin, we calculated the number of SNPs residing in each genomic interval (bin) for all Hi-C bins across all bin sizes. We then calculated the median number of SNPs per bin, and produced a minimum required SNP density cutoff defined as the (median − 1.5 × IQR). Any bins with less SNPs than the cutoff were removed from all analyses. The SNP density cutoffs used for each bin size were: 40 kb, 43 SNPs; 100 kb, 216 SNPs; 250 kb, 776.5 SNPs; 500 kb, 1,767.25 SNPs. The non-SNP-density-filtered data was only used for visualization purposes (figure heat maps). Throughout the manuscript, we refer to Hi-C as data that has been iteratively corrected³¹ and run through the SNP-density filter.

Compartment analysis

The presence and location of the A/B compartments were calculated as previously described³². Compartments were derived from the 250-kb iteratively corrected Hi-C data for each chromosome separately using the CIS maps for each sample/allele. The code used to generate the compartments (PC1 from PCA analysis) is publicly available on Github (matrix2compartment.pl): https://github.com/dekkerlab/giorgetti-nature-2016. Compartments were generated all default options except the (cis alpha) option, set to (-ca 0.005).

Insulation and boundary calculation

TAD structure (insulation/boundaries) was defined via the insulation method as previously described with minor modifications³². The code used to calculate the insulation score is publicly available on Github (matrix2insulation.pl): https://github.com/dekkerlab/giorgetti-nature-2016. Insulation vectors were detected using the following options: (-is 480000 -ids 320000 -im iqrMean -nt 0 -ss 160000 -yb 1.5 -nt 0 -bmoe 0). The output of the insulation script is a vector of insulation scores, and a list of minima along the insulation vector (inferred as TAD boundaries). The TAD boundaries were not used in this study.

Interaction pile-up maps

Interaction pile-up maps were constructed from all pairwise interactions between either the list of 87 wild-type NPC Xi chromosome escapees or the 29 ΔFT NPC Xi chromosome escapees. Using the 40 kb Hi-C data, a 2-Mb window centred around each pairwise interaction (pixel) was taken (25 bins in each direction, yielding 51 × 51 sub-matrix). Any resulting sub-megabase that overlapped the (y = x) diagonal in the matrix was excluded from the analysis (effectively excluding all interactions <2 Mb). All sub-matrices were then averaged to produce the final (mean) pile-up map. A strong signal at the centre suggests that the elements used tend to contact one another in 3D space.

The Xi chromosome is as accessible and detectable in Hi-C as the Xa chromosome and autosomes

The number of RAW reads observed for both the Xa and Xi chromosomes were very similar for all chromosomes, thus demonstrating that the Xi chromosome is not simply less accessible/visible to the Hi-C methodology. ES-cell-chrX-129S1, 1,118,327; ES-cell-chrX-Cast, 1,104,709; NPC-chrX-129S1, 1,147,072; NPC-chrX-Cast, 1,148,128; ΔFTNPC-chrX-129S1, 1,314,476; ΔFTNPC-chrX-Cast, 1,288,802. Bias in read directional due to partial digestion is typically observed up to ~10 kb. For interactions between fragments separated by over 10 kb this bias is negligible, indicating at least one digestion occurring between them in every cell. This genomic distance is therefore a measure for digestion efficiency²⁷. For both the Xa and Xi chromosomes, this genomic distance is ~6–10 kb, indicating that digestion efficiency of chromatin on the Xa and Xi chromosomes are comparable. Thus, the unique conformation of the Xi chromosome does not affect Hi-C analysis, as was also found for condensed mitotic chromosomes²⁸.

Correlation analysis of Hi-C matrices

We compared X chromosome interaction matrices of Xist-inducible lines (pre/post-Xist induction in wild-type and A-repeat mutant samples) with that of the NPC Xi chromosome, at 500-kb resolution. As distance-dependent decay of interaction frequency causes all interaction matrices to be highly correlated, we first compensated for this effect by multiplying the read count in each bin by its respective genomic distance. We then calculated the Spearman correlation between each pair of matrices. After wild-type Xist induction in TXY cells, the interaction map becomes more similar to that of the NPC Xi chromosome (rho = 0.17 to 0.31), while no change is observed in the repeat-A mutant (rho = 0.17 to 0.17).

RNA and 3D-DNA FISH

FISH was performed as previously described³³. ES cells and NPCs were cultured on gelatin-coated coverslips #1.5 (1 mm) and fixed in 3% paraformaldehyde for 10 min at room temperature. Cells were permeabilized on ice for 5 min in 1× PBS, 0.5% Triton X-100 and 2 mM vanadyl-ribonucleoside complex (VRC, New England Biolabs), and coverslips were stored in 70% ethanol at −20 °C. Before FISH, samples were dehydrated through an ethanol series (80%, 95%, 100% twice) and air-dried briefly. For RNA FISH, cells were directly hybridized with denatured probes. For DNA FISH, samples were first denatured in 50% formamide/2× SSC (pH = 7.3) at 80 °C for 37 (ES cell) and 35 (NPC) min, immediately placed on ice and washed twice with ice-cold 2× SSC. After overnight hybridization at 37 °C for RNA FISH or 42 °C for DNA FISH, coverslips were washed at 42 °C for RNA or 45 °C for DNA, three times for 5 min in 50% formamide/2× SSC at pH 7.3, and three times for 5 min in 2× SSC. Nuclei were counterstained with 0.2 mg ml⁻¹ DAPI (2 mg ml⁻¹ for structured illumination microscopy), further washed twice for 5 min in 2× SSC at room temperature and finally mounted with 90% glycerol, 0.1× PBS, 0.1% p-phenylenediamine at pH9 (Sigma).

RNA FISH probes

We used the p510 plasmid coupled with Cy5 to detect Xist. For RNA FISH on escape genes, we used the following BAC and fosmid probes: RP23-436K3, RP23-328M22, RP24-436K3, WI1-1269O10 (Mecp2), RP24-157H12 (Huwe1), RP23-13D21 (G6pdx), RP24-148H21 (Jarid1c).

DNA FISH probes

In experiments to detect the mega-domain boundary, fluorescent oligonucleotides (average length 45 bp, 5′-modified with Atto 448 or Atto 550, average density: one oligonucleotide every 3 kb) were obtained from MYcroarray Inc. Oligonucleotides were designed to tile the following consecutive 18-Mb regions: chrX:35,000,000–53,000,000, chrX:53,000,000– 72,000,000, and chrX:72,000,000–90,000,000. To detect the DXZ4 region we used the RP23-299L1 BAC.

Imaging and quantification of 3D DNA FISH

Three-dimensional image stacks (200 nm distance between consecutive xy planes) were acquired on a DeltaVision Core wide-field microscope (Applied Precision) equipped with a CoolSNAP HQ2 camera operated at 2X binning, and a 100× PlanApo oil immersion objective (the effective pixel size was 129 × 129 nm). Xi chromosome signals were identified via the presence of an Xist mRNA cloud in the far-red channel (p510-Cy5 probe). Pearson correlation between red and green signals was calculated using custom-made ImageJ macros as follows. After subtracting the background from each xy plane (generated by morpholoigcal opening the image with a circle of 5 pixels in radius), Pearson correlation between red and green pixel intensities was measured inside a fixed-size region of 40 × 40 × 20 pixels (5.16 × 5.16 × 4 μm³) centred on each FISH signal. The significance of Xi versus Xa chromosome differences in correlation was assessed by Wilcoxon’s rank sum test. Random nuclear positions were used to estimate the background correlation that could be observed due to non-specific probe hybridization.

The gyration tensor of a greyscale image is defined as

where k is an index running over voxels, is the greyscale intensity of voxel k, and and are the a-th components (x, y, or z) of the xyz position of voxel k, and of the centre of mass of the image, respectively. The gyration tensor was valuated in a region of interest of 3.8 × 3.8 × 4 μm³ centred on each FISH signal and the gyration radius was calculated as

where λ_1,2,3 are the eignevalues of S_ab.

RNA-seq

RNA-seq data for the ES cell (GUR.2d) and NPC (GEI.72b) was obtained from previously published work (PMID 24576422)¹⁶. RNA-seq data for the mutant NPC (D9B2/B129T3) was obtained and processed as previously described¹⁶.

RNA-seq ‘expressed/escapee’ classification

The allelic RPKM values were derived for each gene by splitting the RPKM value by the 129 ratio. 129 RPKM = (RPKM × 129 ratio); Cast RPKM = (RPKM × (1 − 129 ratio)). Any gene with an allelic RPKM value ≥3 RPMK was classified as being expressed. Any gene expressed on the Xi chromosome was classified as being an escapee.

ATAC–seq

ATAC–seq library preparation was performed exactly as previously described³⁴. Sequencing was carried out on an Illumina NextSeq 500 generating 2 × 75 bp paired-end reads. Libraries were sequenced to a depth of 25–35 million reads per sample. Reads were trimmed using CutAdapt and aligned using Bowtie2. Reads were aligned to a custom 129/CastEiJ genome in which SNP sites were replaced by ‘N’. Approximately 52–58% of reads per line contained ‘N’s and were assigned to the 129 or Cast allele based on the identity of the base at that location. Reads containing non-concordant SNPs were rare and were discarded. Reads not containing SNP sites were included in overall peaks but not were excluded from allele-specific tracks. ATAC–seq peaks were called using MACS2 with no shifting model. For TXY Xist-inducible male cell lines, which contain only one X chromosome, peaks were called after normalizing all samples by read count on the autosomes. The set of X chromosome peaks was defined as the merge of peaks from all samples, and bedtools was used to calculate coverage within peaks.

Assigning allele-specific ATAC–seq peaks

For each ATAC–seq peak, all N-containing reads were counted and assigned to 129 or Cast alleles based on SNP at the N-containing position. To assign monoallelic and biallelically accessible peaks, a d-score was calculated as a measure of allelic imbalance³⁵. In brief, for a given peak the d-score was calculated as the ratio of 129 reads to total number of reads − 1/2. A peak with a d-score ≥ 0.3 was assigned as a 129-specific peak. A peak with a d-score ≤ −0.3 was assigned as a Cast-specific peak. Any peak with a d-score > −0.3 was assigned as a peak in 129 (monoallelic or biallelic). Any peak with a d-score < +0.3 was assigned as a peak in Cast (monoallelic or biallelic).

Annotating ATAC–seq peaks using ChIP–seq data

ATAC–seq peaks were annotated using existing published ChIP–seq data sets. CTCF ChIP–seq came from whole female mouse brain³⁶. Called CTCF binding sites were used and extended ±300 bp before overlapping with ATAC–seq peaks. H3K27ac and p300 ChIP–seq are from mouse NPCs³⁷. For H3K27ac and p300 ChIP–seq data, peaks were called using MACS2 and then overlapped with ATAC–seq peak locations.

Integrating Hi-C, ATAC–seq and RNA-seq data

Integrative analysis of Hi-C insulation (TAD structure), ATAC–seq counts, and RNA-seq RPKM was performed as follows. A promoter region was defined for each gene as ±500 bp from the TSS. ATAC peaks were assigned to a gene if they overlapped with the promoter region. In the event that >1 ATAC peak overlapped with the promoter, the closer ATAC peak was chosen. An ATAC count of 0 was assigned to each promoter, if it did not contain an ATAC peak. If the ATAC allelic counts overlapping the promoter were <10, then the ATAC count was set to ‘NA’. The 40-kb bin overlapping the promoter region was used to display the insulation and insulation-difference value.

Accession codes

Primary accessions

Gene Expression Omnibus

Data deposits

Sequencing data have been deposited in the Gene Expression Omnibus (GEO) under accession numbers GSE72697 (Hi-C); GSE71156 (ATAC–seq); and GSE72697 (boundary deletion data).

References

Wutz, A., Rasmussen, T. P. & Jaenisch, R. Chromosomal silencing and localization are mediated by different domains of Xist RNA. Nature Genet. 30, 167–174 (2002)
Article CAS Google Scholar
Chu, C. et al. Systematic discovery of Xist RNA binding proteins. Cell 161, 404–416 (2015)
Article CAS Google Scholar
Peeters, S. B., Cotton, A. M. & Brown, C. J. Variable escape from X-chromosome inactivation: identifying factors that tip the scales towards expression. BioEssays 36, 746–756 (2014)
Article CAS Google Scholar
Chaumeil, J., Le Baccon, P., Wutz, A. & Heard, E. A novel role for Xist RNA in the formation of a repressive nuclear compartment into which genes are recruited when silenced. Genes Dev. 20, 2223–2237 (2006)
Article CAS Google Scholar
Splinter, E. et al. The inactive X chromosome adopts a unique three-dimensional conformation that is dependent on Xist RNA. Genes Dev. 25, 1371–1383 (2011)
Article CAS Google Scholar
Engreitz, J. M. et al. The Xist lncRNA exploits three-dimensional genome architecture to spread across the X chromosome. Science 341, 1237973 (2013)
Article Google Scholar
Rao, S. S. P. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 162, 687–688 (2015)
Article CAS Google Scholar
Deng, X. et al. Bipartite structure of the inactive mouse X chromosome. Genome Biol. 16, 152 (2015)
Article Google Scholar
Minajigi, A. et al. Chromosomes. A comprehensive Xist interactome reveals cohesin repulsion and an RNA-directed chromosome conformation. Science 349, aab2276 (2015)
Article Google Scholar
Nora, E. P. et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485, 381–385 (2012)
Article ADS CAS Google Scholar
Giacalone, J., Friedes, J. & Francke, U. A novel GC-rich human macrosatellite VNTR in Xq24 is differentially methylated on active and inactive X chromosomes. Nature Genet. 1, 137–143 (1992)
Article CAS Google Scholar
Chadwick, B. P. DXZ4 chromatin adopts an opposing conformation to that of the surrounding chromosome and acquires a novel inactive X-specific role involving CTCF and antisense transcripts. Genome Res. 18, 1259–1269 (2008)
Article CAS Google Scholar
Teller, K. et al. A top-down analysis of Xa- and Xi-territories reveals differences of higher order structure at ≥ 20 Mb genomic length scales. Nucleus 2, 465–477 (2011)
Article Google Scholar
Naughton, C., Sproul, D., Hamilton, C. & Gilbert, N. Analysis of active and inactive X chromosome architecture reveals the independent organization of 30 nm and large-scale chromatin structures. Mol. Cell 40, 397–409 (2010)
Article CAS Google Scholar
Eils, R. et al. Three-dimensional reconstruction of painted human interphase chromosomes: active and inactive X chromosome territories have similar volumes but differ in shape and surface structure. J. Cell Biol. 135, 1427–1440 (1996)
Article CAS Google Scholar
Gendrel, A.-V. et al. Developmental dynamics and disease potential of random monoallelic gene expression. Dev. Cell 28, 366–380 (2014)
Article CAS Google Scholar
Zuin, J. et al. Cohesin and CTCF differentially affect chromatin architecture and gene expression in human cells. Proc. Natl Acad. Sci. USA 111, 996–1001 (2014)
Article ADS CAS Google Scholar
Sofueva, S. et al. Cohesin-mediated interactions organize chromosomal domain architecture. EMBO J. 32, 3119–3129 (2013)
Article CAS Google Scholar
Vietri Rudan, M. et al. Comparative Hi-C reveals that CTCF underlies evolution of chromosomal domain architecture. Cell Reports 10, 1297–1309 (2015)
Article CAS Google Scholar
Schulz, E. G. et al. The two active X chromosomes in female ESCs block exit from the pluripotent state by modulating the ESC signaling network. Cell Stem Cell 14, 203–216 (2014)
Article CAS Google Scholar
Yang, F. et al. The lncRNA Firre anchors the inactive X chromosome to the nucleolus by binding CTCF and maintains H3K27me3 methylation. Genome Biol. 16, 52 (2015)
Article Google Scholar
Wutz, A. & Jaenisch, R. A shift from reversible to irreversible X inactivation is triggered during ES cell differentiation. Mol. Cell 5, 695–705 (2000)
Article CAS Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012)
Article ADS CAS Google Scholar
Corbel, C., Diabangouaya, P., Gendrel, A.-V., Chow, J. C. & Heard, E. Unusual chromatin status and organization of the inactive X chromosome in murine trophoblast giant cells. Development 140, 861–872 (2013)
Article CAS Google Scholar
Patrat, C. et al. Dynamic changes in paternal X-chromosome activity during imprinted X-chromosome inactivation in mice. Proc. Natl Acad. Sci. USA 106, 5198–5203 (2009)
Article ADS CAS Google Scholar
Horakova, A. H. et al. The mouse DXZ4 homolog retains Ctcf binding and proximity to Pls3 despite substantial organizational differences compared to the primate macrosatellite. Genome Biol. 13, R70 (2012)
Article Google Scholar
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009)
Article ADS CAS Google Scholar
Naumova, N. et al. Organization of the mitotic chromosome. Science 342, 948–953 (2013)
Article ADS CAS Google Scholar
Lajoie, B. R., Dekker, J. & Kaplan, N. The Hitchhiker’s guide to Hi-C analysis: practical guidelines. Methods 72, 65–75 (2015)
Article CAS Google Scholar
Selvaraj, S., R Dixon, J., Bansal, V. & Ren, B. Whole-genome haplotype reconstruction using proximity-ligation and shotgun sequencing. Nat. Biotechnol. 31, 1111–1118 (2013)
Article CAS Google Scholar
Imakaev, M. et al. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat. Methods 9, 999–1003 (2012)
Article CAS Google Scholar
Crane, E. et al. Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature 523, 240–244 (2015)
Article ADS CAS Google Scholar
Chaumeil, J., Augui, S., Chow, J. C. & Heard, E. Combined immunofluorescence, RNA fluorescent in situ hybridization, and DNA fluorescent in situ hybridization to study chromatin changes, transcriptional activity, nuclear organization, and X-chromosome inactivation. The Nucleus 463, 297–308 (2008)
Article CAS Google Scholar
Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013)
Article CAS Google Scholar
Eckersley-Maslin, M. A. et al. Random monoallelic gene expression increases upon embryonic stem cell differentiation. Dev. Cell 28, 351–365 (2014)
Article CAS Google Scholar
Berletch, J. B. et al. Escape from X inactivation varies in mouse tissues. PLoS Genet. 11, e1005079 (2015)
Article Google Scholar
Martynoga, B. et al. Epigenomic enhancer annotation reveals a key role for NFIX in neural stem cell quiescence. Genes Dev. 27, 1769–1786 (2013)
Article CAS Google Scholar
McHugh, C. A. et al. The Xist lncRNA interacts directly with SHARP to silence transcription through HDAC3. Nature 521, 232–236 (2015)
Article ADS CAS Google Scholar

Download references

Acknowledgements

We thank members of the Heard, Dekker, and Chang laboratories for their help and critical insights; PICT-IBiSA@BDD (UMR3215/U934) Imaging facility of the Institut Curie. L.G. would like to thank L. Mirny for discussing gyration tensor analysis. Supported by grants from the National Institutes of Health (P50-HG007735) and Scleroderma Research Foundation (to H.Y.C.), from the National Human Genome Research Institute (R01 HG003143) and the National Institutes of Health Common Fund, National Institute of Diabetes and Digestive and Kidney Diseases (U54 DK107980) to J.D., the Human Frontier Science Program to N.K., an EMBO Fellowship to L.G., an ERC Advanced Investigator award (ERC-2014-AdG no. 671027), EU FP7 grants SYBOSS (EU 7th Framework G.A. no. 242129) and MODHEP (EU 7th Framework G.A. no. 259743), La Ligue, Fondation de France, Labex DEEP (ANR-11-LBX-0044) part of the IDEX Idex PSL (ANR-10-IDEX-0001-02 PSL) and ABS4NGS (ANR-11-BINF-0001) to E.H. J.D. is an investigator of the Howard Hughes Medical Institute.

Author information

Luca Giorgetti
Present address: †Present address: Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, CH-4058 Basel, Switzerland.,
Luca Giorgetti, Bryan R. Lajoie, Ava C. Carter and Mikael Attia: These authors contributed equally to this work.

Authors and Affiliations

Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, 26 Rue d’Ulm, Paris, 75248, Cedex 05, France
Luca Giorgetti, Mikael Attia, Chong Jian Chen & Edith Heard
Department of Biochemistry and Molecular Pharmacology, Program in Systems Biology, University of Massachusetts Medical School, 368 Plantation Street, Worcester, 01605, Massachusetts, USA
Bryan R. Lajoie, Ye Zhan, Noam Kaplan & Job Dekker
Center for Personal Dynamic Regulomes and Program in Epithelial Biology, Stanford University School of Medicine, Stanford, 94305, California, USA
Ava C. Carter, Jin Xu & Howard Y. Chang
Collège de France, 11 place Marcelin-Berthelot, Paris, 75005, France
Edith Heard
Howard Hughes Medical Institute, University of Massachusetts Medical School, 368 Plantation Street, Worcester, 01605, Massachusetts, USA
Job Dekker

Authors

Luca Giorgetti
View author publications
You can also search for this author in PubMed Google Scholar
Bryan R. Lajoie
View author publications
You can also search for this author in PubMed Google Scholar
Ava C. Carter
View author publications
You can also search for this author in PubMed Google Scholar
Mikael Attia
View author publications
You can also search for this author in PubMed Google Scholar
Ye Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Jin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chong Jian Chen
View author publications
You can also search for this author in PubMed Google Scholar
Noam Kaplan
View author publications
You can also search for this author in PubMed Google Scholar
Howard Y. Chang
View author publications
You can also search for this author in PubMed Google Scholar
Edith Heard
View author publications
You can also search for this author in PubMed Google Scholar
Job Dekker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.H. and J.D. conceived the original strategy. For Hi-C, M.A. and L.G. prepared the ES cell and NPC samples, Y.Z. performed the Hi-C experiments; for ATAC–seq, H.Y.C., E.H. and J.D. designed the experiments; A.C.C. and J.X. prepared the samples and performed the experiments; for RNA-seq, M.A. and L.G. prepared the samples and performed the experiments and C.J.C. analysed the data. Integrated analysis of Hi-C sequencing, RNA-seq and ATAC–seq data was performed by B.R.L., J.X., L.G. and A.C.C., with assistance from C.J.C. and N.K. and input from J.D.; L.G. and E.H. designed the FISH experiments, L.G. and M.A. performed FISH experiment and L.G. analysed data. M.A., L.G. and E.H. designed the NPC and DXZ4 mutant strategy, M.A. and L.G. performed the experiments and analysed them. L.G., B.R.L., A.C.C., E.H. and J.D. wrote the manuscript with input from H.Y.C.

Corresponding authors

Correspondence to Edith Heard or Job Dekker.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Extended data figures and tables

Extended Data Figure 1 Description of allele-specific Hi-C and ATAC–seq.

a, Schematic of hybrid mouse strains used for all experiments. b, Top, scheme outlining differentiation of ES cells to NPCs and picking of clones. Bottom, scheme outlining CRISPR deletion of the mega-domain boundary in ES cells, differentiation to NPCs and the picking of clones. c, Schematic of Hi-C library generation. d, Schematic of the Hi-C alignment strategy. Paired-end reads are aligned to a ‘diploid’ genome consisting of 22 chromosomes from Cast, and 22 chromosomes from 129 (1–19 X, Y, M). The interaction row shows all possible paired-end read combinations between the 129, Cast and ambiguous (AMB) genomes. e, Schematic showing the re-assignment of a subset of ‘cis’ interactions. Paired-end reads in which one side uniquely aligned to an allele and the other side aligned equally to both alleles (AMB), were re-classified as allelic reads, only if both reads aligned to the same chromosome (cis). f, Cartoon explaining the re-assignment of 129:amb or cast:amb cis interactions. g, Scheme for ATAC–seq library preparation. Cells are lysed followed by incubation with adaptor-loaded hyperactive Tn5 transposase. The transposase integrates into accessible DNA, and these fragments are then directly amplified and sequenced. h, Scheme for allele-specific ATAC–seq data analysis. i, SNP-mapping simulation. For each SNP location on the X chromosome, all overlapping 50 bp reads were extracted (50 total) for each of the 129 and Cast alleles. All reads were then processed through the Hi-C and ATAC–seq mapping pipelines described in the Methods to measure assignment accuracy. Results are shown in the table.

Extended Data Figure 2 Structure of the Xa chromosome in ES cells, compartment analysis and characterization of interactions between genes that escape XCI.

a, Allele-specific Hi-C contact maps for X chromosome in ES cells at 500-kb resolution (top), and for a ~40-Mb region centred around the DXZ4-containing locus at 40-kb resolution (bottom). The insulation score is plotted at the bottom of each 40-kb heat map as in Fig. 1a. Purple shaded areas indicate the IQR of insulation scores along the chromosome. b, Zoomed in view of three regions showing Hi-C interactions, RNA-seq and ATAC–seq signal on the Cast and 129 Xa chromosomes in ES cells. c, Compartment profiles of X chromosome in ES cells and NPCs. The first eigenvector (PC1) of each allele-specific Hi-C contact map, obtained with principal component analysis, is shown, together with the difference in chromosome-wide insulation score between the 129 and Cast allele. A/B-compartments are evident in ES cells and NPCs along both Xa (red and blue signal), whereas the first eigenvector corresponds to the two mega-domains for the Xi chromosome in NPCs. In ES cells, both Xa chromosomes display comparable insulation profiles (difference is close to 0 along the chromosome), whereas in NPCs large differences are observed (difference in insulation fluctuates along the chromosome). Grey areas indicate regions with low SNP density that were excluded from analysis. d, Same plots as in Fig. 2e for the same set of genes that escape XCI in NPCs, but using Hi-C data obtained in ES cells. e, As in d, where the interactions between loci that escape XCI on the wild-type Xi chromosome were divided in three different groups, according to whether pairs of loci belong to the same or different mega-domains. Interactions across the mega-domain boundary only occur on the wild-type Xi chromosome and are lost on the ΔFT Xi chromosome concomitant with loss of transcription of constitutive escapees.

Extended Data Figure 3 Comparison of allele-specific Hi-C maps of chromosomes X and 13 in ES cells, NPCs and boundary-mutant ΔFT NPCs.

a, Hi-C data, insulation scores, and the difference in insulation scores (129 Cast) are shown for ES cells, NPCs and mutant ΔFT NPCs for both alleles (Cast and 129) for the X chromosome. Large dips in the insulation vector are found at TAD boundaries. Peaks in the insulation vector are found towards the centre of each TAD. The insulation difference plot highlights areas of differential TAD structure between the alleles (many differences along the X chromosome in NPCs as compared to the allelic differences along autosomes, see b). b, Same as a for chromosome 13. The insulation difference plot highlights areas of differential TAD structure between the alleles (rare).

Extended Data Figure 4 Quantitative analysis of 3D DNA FISH on the Xa and Xi chromosomes.

a, Top left, scheme of the procedure used to quantify the Pearson correlation. A background is generated for each xy plane in a three-dimensional z-stack by morphological opening the image with a circle of 5 pixels in radius, and subtracted from it. Pearson correlation between red and green pixel intensities is measured inside a fixed-size region of 40 × 40 × 20 pixels (5.16 × 5.16 × 4 μm) centred on each FISH signal. To demonstrate that background subtraction does not affect the measured correlations, we show here a line-scan of 10 μm across a typical DNA FISH signal (top right). The shape of the signals along the line scan, as well as their relative intensities, is not affected by background subtraction (bottom). Pre- and post- refer to line scans taken on images before and after background subtraction, respectively. b, In more than 80% of nuclei in NPCs, Pearson correlations are higher on the Xi than on the Xa chromosome. Shown is NPC clone C2 (the same where Hi-C was performed). c, Same quantification as in Fig. 1c (and b) for an independent NPC clone (E1) where the active X is on the 129 allele and the inactive X chromosome on the Cast, and in astrocytes derived from NPC clone C2. n denotes the number of cells analysed in DNA FISH. Centre lines represent medians; crosses denote experimental points that were considered as outliers (larger than (q3 + 1.5 × (q3 − q1) or smaller than q1 − w(q3 − q1), where q1 and q3 are the twenty-fifth and seventy-fifth percentiles of the data distribution, respectively). d, Scheme of the gyration tensor based analysis of FISH volumes (see Methods). e, Left, gyration radii of DNA FISH signals from probes a, b and c. Probe b was used in combination with both probes a and c separately in two independent experiments. Statistical significance was assessed by Wilcoxon’s rank sum test (*P < 0.05, **P < 1 × 10⁻⁵). The mean gyration radii for Xa and Xi chromosome signals are indicated by dotted lines as a guide for the eye. On the Xi chromosome, the mean gyration radius is approximately 6% smaller than on the Xa chromosome corresponding to approximately 20% smaller volume. Right, representative images of probe a, showing smaller size and increased roundness of the Xi chromosome signals. f, Left, scheme of the thresholding-based method for volume quantification. Thirty increasing threshold levels were imposed, starting from the residual grayscale background level surrounding the signal, up to the minimum between the red and green channel grayscale maxima. For each of these thresholds we determined the number of voxels in each channel, where the greyscale intensity was higher than the threshold. Centre, the fraction of cells where the Xa chromosome signal is larger than the Xi chromosome is between 60% and 80% in the entire threshold range. Right, in a wide range of thresholds, the volume of Xa chromosome signals is approximately 25% bigger than Xi chromosome signals. Results are shown here for probes a and b; the same holds for probe c (not shown). n denotes the number of cells analysed in DNA FISH. Centre lines: medians. Boxes: middle 50% of data points. Two biological replicates were analysed for each experiment.

Extended Data Figure 5 Integrative analysis of allele-specific Hi-C, RNA-seq and ATAC–seq.

a, X-chromosome-wide ATAC–seq and RNA-seq in ES cells and NPCs. ATAC shows signal for ambiguous, 129- and Cast-specific reads in ES cells and NPCs. RNA-seq shows total signal as well as expressed gene calls. ATAC–seq shows global loss of chromatin accessibility and expression on the Xi chromosome, except at specific locations that mostly overlap with escape genes. Dotted line denotes mega-domain boundary. Position of constitutive escapees was adapted from ref. 3. b, Escape genes on the Xi chromosome (as determined by RNA-seq) fall within regions with high ATAC–seq signal (Kolmogorov–Smirnov test P < 2.2 × 10⁻¹⁶). c, Pie charts showing the distribution of peaks that escape XCI versus the peaks that are unique to the Xa chromosome. Peaks are classified into those that are promoter-proximal (within 5 kb of TSS) and distal (>5 kb from TSS). Annotations are based on binding sites identified by ChIP–seq^37,38. d, Scatter plot showing the relationship between TAD signal (insulation score) and number of expressed genes of the wild-type NPC 129 (Xi) Hi-C data set. The y axis shows the mean insulation score in a 480-kb window with a 40-kb step size. The x axis shows the number of expressed genes in a 480-kb window with a 40-kb step size. The Pearson R value (0.395) is shown above and is calculated on only the middle 90% (red points) of the residuals. Outliers are shown in black (bottom 5% and top 5% of residuals)

Extended Data Figure 6 Analysis of wild-type and boundary-mutant ΔFT NPCs by FISH and RT–PCR followed by pyrosequencing.

a, Scheme of the strategy used to delete the mega-domain boundary region in ES cells and to derive ΔFT NPCs. b, RNA FISH against constitutive and facultative escapees confirms RNA-seq and ATAC–seq results in the ΔFT NPC clone D9B2. Top, the positions of BAC probes (RP23-328M22 and RP23-436K) are shown relative to the escape genes that they span. Coloured gene names correspond to transcripts that were detected with specific fosmid probes. Bottom, sample RNA FISH images showing that expression of facultative (Mecp2 and BAC probes) but not constitutive (Jarid1c) escapees is lost on the ΔFT Xi chromosome. c, Quantification of the RNA FISH experiment in b. d, RT–PCR followed by pyrosequencing of Xist, two facultative escape genes (Huwe1 and Mecp2) and the constitutive escapee Jarid1c indicate the percentage of expression from the 129 allele (red) or the Cast allele (blue) in 56 wild-type and 17 ΔFT NPC clones. Absence of Mecp2 and Huwe1 escape is observed in a moderately higher proportion of ΔFT than wild-type clones. Absence of Jarid1c escape is never observed.

Extended Data Figure 7 Allele-specific Hi-C analysis of wild-type and boundary-mutant ΔFT NPCs.

a, Hi-C data, insulation scores, and the difference in insulation scores are shown to compare the wild-type Xi chromosome (NPC 129) and the ΔFT Xi chromosome (NPC 129). Top, the Cast allele (Xa chromosome) for both samples. Bottom, the 129 allele (Xi chromosome) for both samples. Large dips in the insulation vector are indicative of TAD boundaries. Peaks in the insulation vector are found towards the centre of each TAD. The insulation difference plot highlights areas of differential TAD structure between the wild-type and ΔFT NPCs. b, Cumulative plots of TAD strength at the wild-type expressed versus the wild-type silenced genes on Cast and 129 chromosomes, for ES cells, wild-type and ΔFT NPCs. Escapee genes on the Xi chromosome (NPC 129, ΔFT NPC 129) show higher insulation scores as compared to silenced genes.

Extended Data Figure 8 3D DNA FISH and integrative ATAC–seq/RNA-seq analysis in ΔFT NPCs.

a, Top left, scheme of the DNA FISH probe sets (a–b: inside the same mega-domain, b–c: across the boundary). Bottom, loci detected by probe set b–c are more interacting in the ΔFT than in the wild-type Xi chromosome both in Hi-C (left) and in 3D-DNA FISH (right), showing loss of mega-domain boundary. *P < 2 × 10⁻⁴, **P < 1 × 10⁻⁵ (Wilcoxon’s rank sum test corrected with Bonferroni for multiple hypothesis testing). Right, sample RNA/DNA FISH images showing that signals from probe set b–c are more overlapping on the ΔFT Xi than on the wild-type Xi chromosome. Two biological replicates were analysed. b, Chromosome-wide ATAC–seq and RNA-seq signal generated with ambiguous (mm9), 129- and Cast-specific reads in wild-type NPCs and ΔFT NPCs, showing global loss of chromatin accessibility on the ΔFT Xi chromosome except at the X-inactivation centre (Xic) region and constitutive escape genes. Dotted line indicates the position of the mega-domain boundary. c, Zoomed-in view of a region on the ΔFT Xi chromosome encompassing the Xic showing Hi-C interactions, RNA-seq and ATAC–seq signal. ATAC–seq from wild-type NPCs is included for reference (previously shown in Fig. 1).

Extended Data Figure 9 3D DNA FISH and integrative Hi-C/ATAC-seq analysis of inducible Xist ES cell lines.

a, Left, schematic representation of TXY and TXY:ΔA male cell lines, carrying a tetracycline-inducible wild-type and A-repeat mutant Xist, respectively, at the endogenous Xist locus. Centre right, RNA FISH against G6pdx and a group of genes recognized by the RP23-436K BAC (see Extended Data Fig. 6b) showing that expression of X-chromosome-linked genes is lost upon induction of wild-type but not A-repeat mutant Xist in male ES cells. TXY and TXY:ΔA were treated with doxycycline for 48 h. b, Representative images from RNA/DNA FISH experiments in TXY and TXY:ΔA male ES cell lines, showing increased overlap of probes a–b on the Xist-coated X chromosome after 48 h of wild-type, but not ΔA Xist induction. c, Left, schematic representation of TX1072 female ES cells in which Xist expression can be induced via a tetracycline-responsive promoter at one of the two endogenous Xist loci. Center right, RNA/DNA FISH in TX1072 cells treated for 3 days with doxycycline. Probes a–b overlap more on the Xist-coated than on the wild-type X chromosome, whereas signals from b–c show lower overlap and partitioning of the Xist-coated chromosome into two separate domains. *P < 1 × 10⁻⁷ (Wilcoxon’s rank sum test corrected with Bonferroni for multiple hypothesis testing). n denotes the number of cells analysed in DNA FISH. Centre lines: medians. Boxes: middle 50% of data points. Two biological replicates were analysed. d, Correlation between changes in Hi-C interactions and ATAC–seq signal following 48-h wild-type Xist induction in male ES cells (TXY cell line). log₂(+dox/−dox) was calculated for bins of 40 kb (left), 100 kb (middle), and 500 kb (right). e, Model of mega-domain boundary-mediated control of chromosome folding and facultative escape. Xist coating causes gene silencing and initiates chromosome-wide conformational changes in a manner dependent on its A-repeat region, which result in formation of mega-domains, overall compaction of chromosome folding, and loss of TADs. During differentiation, transient interactions with the mega-domain boundary may occur and may contribute to facultative escape and re-establishment of TADs at facultative escape loci, although the presence of residual TADs at escapee loci on the Xi chromosome is not necessarily due to the mega-domain boundary.

Supplementary information

Supplementary Table

This table contains all relevant RNA-Seq data for all locations (genes) along the X chromosome. The table columns are as follows: 1, xloc; 2, chr; 3, start; 4, end; 5, gene; 6, B129T3__129S1__category; 7, B129T3__129S1__pval; 8, B129T3__129S1__reads; 9, B129T3__129S1__rpkm; 10, B129T3__129S1__status; 11, B129T3__CAST__category; 12, B129T3__CAST__pval; 13, B129T3__CAST__reads; 14, B129T3__CAST__rpkm; 15, B129T3__CAST__status; 16, GEI.72b__129S1__category; 17, GEI.72b__129S1__pval; 18, GEI.72b__129S1__reads; 19, GEI.72b__129S1__rpkm; 20, GEI.72b__129S1__status; 21, GEI.72b__CAST__category; 22, GEI.72b__CAST__pval; 23, GEI.72b__CAST__reads; 24, GEI.72b__CAST__rpkm; 25, GEI.72b__CAST__status; 26, GUR.2d__129S1__category; 27, GUR.2d__129S1__pval; 28, GUR.2d__129S1__reads; 29, GUR.2d__129S1__rpkm; 30, GUR.2d__129S1__status; 31, GUR.2d__CAST__category; 32, GUR.2d__CAST__pval; 33, GUR.2d__CAST__reads; 34, GUR.2d__CAST__rpkm; 35, GUR.2d__CAST__status. xloc is a numerical ID for each gene location. chr is the chromosome. start is the start position of the gene. end is the end position of the gene. (for positions, start > end, not re-oriented by strand) gene is the gene name. The remaining columns are broken down into groups of 5 per sample, per allele. NNNN is the sample name. ESC = GUR.2d; WT NPC = GEI.72b; ΔFT NPC = B129T3 (D9B2). XXXX is the allele, 129S1 for the 129 allele, CAST for the Cast allele. The five columns are: NNNN__XXXX_category, category assignment of expression (bi,mono,biased,na, see ref 20) NNNN__XXXX__pval, p-value of the allelic assignment. NNNN__XXXX__reads, number of allelic reads. NNNN__XXXX__rpkm, RPKM value for the allelic gene. NNNN__XXXX__status, expression status of the gene, expressed or silenced. We defined expressed as ≥ 3 RPKM. (XLSX 5591 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

Rights and permissions

Reprints and permissions

About this article

Cite this article

Giorgetti, L., Lajoie, B., Carter, A. et al. Structural organization of the inactive X chromosome in the mouse. Nature 535, 575–579 (2016). https://doi.org/10.1038/nature18589

Download citation

Received: 24 July 2015
Accepted: 18 May 2016
Published: 18 July 2016
Issue Date: 28 July 2016
DOI: https://doi.org/10.1038/nature18589
Springer Nature Limited

This article is cited by

Computational methods for analysing multiscale 3D genome organization
- Yang Zhang
- Lorenzo Boninsegna
- Jian Ma
Nature Reviews Genetics (2024)
YY1 binding is a gene-intrinsic barrier to Xist-mediated gene silencing
- Joseph S Bowness
- Mafalda Almeida
- Neil Brockdorff
EMBO Reports (2024)
Replication dynamics identifies the folding principles of the inactive X chromosome
- Rawin Poonperm
- Saya Ichihara
- Ichiro Hiratani
Nature Structural & Molecular Biology (2023)
LncRNA XIST regulates breast cancer stem cells by activating proinflammatory IL-6/STAT3 signaling
- Yuxi Ma
- Yongyou Zhu
- Ming Luo
Oncogene (2023)
A comparison of topologically associating domain callers over mammals at high resolution
- Emre Sefer
BMC Bioinformatics (2022)

Associated content

The 3D genome

Collection 02 September 2019

Editorial Summary

Inactive X chromosome structure

During female development, X-chromosome inactivation is triggered by upregulation of the non-coding Xist RNA from one of the two X chromosomes. Chromosome conformation capture approaches have shown a loss of local structure on the inactive X (Xi) and formation of large mega-domains, separated by a region containing the DXZ4 macrosatellite. These authors investigate the structure, chromatin accessibility and expression status of the mouse Xi using allele-specific Hi-C, ATAC–seq and RNA–seq in embryonic stem cells and neural progenitor cells (NPCs). The Xi in NPCs lacks topologically associating domains (TADs) except around genes that escape X-chromosome inactivation, suggesting that TAD formation is driven by gene activity. The DXZ4-containing region and Xist shape the mega-domain structure of the Xi.

Structural organization of the inactive X chromosome in the mouse

Abstract

Similar content being viewed by others

Main

Methods

Cell culture

Boundary deletion

RT–PCR and pyrosequencing

Hi-C read mapping, binning, ICE correction

Allele-specific read mapping validation (Hi-C and ATAC–seq)

Generation of Xist-positive Hi-C signal for comparison with DNA FISH

Hi-C SNP density filter

Compartment analysis

Insulation and boundary calculation

Interaction pile-up maps

The Xi chromosome is as accessible and detectable in Hi-C as the Xa chromosome and autosomes

Correlation analysis of Hi-C matrices

RNA and 3D-DNA FISH

RNA FISH probes

DNA FISH probes

Imaging and quantification of 3D DNA FISH

RNA-seq

RNA-seq ‘expressed/escapee’ classification

ATAC–seq

Assigning allele-specific ATAC–seq peaks

Annotating ATAC–seq peaks using ChIP–seq data

Integrating Hi-C, ATAC–seq and RNA-seq data

Accession codes

Primary accessions

Gene Expression Omnibus

Data deposits

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Extended data figures and tables

Supplementary information

PowerPoint slides

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation