Abstract
Cell fate decisions remain poorly understood at the molecular level. Embryogenesis provides a unique opportunity to analyze molecular details associated with cell fate decisions. Works based on model organisms have provided a conceptual framework of genes that specify cell fate control, for example, transcription factors (TFs) controlling processes from pluripotency to immunity1. How TFs specify cell fate remains poorly understood. Here we report that SALL4 relies on NuRD (nucleosome-remodeling and deacetylase complex) to interpret BMP4 signal and decide cell fate in a well-controlled in vitro system. While NuRD complex cooperates with SALL4 to convert mouse embryonic fibroblasts or MEFs to pluripotency, BMP4 diverts the same process to an alternative fate, PrE (primitive endoderm). Mechanistically, BMP4 signals the dissociation of SALL4 from NuRD physically to establish a gene regulatory network for PrE. Our results provide a conceptual framework to explore the rich landscapes of cell fate choices intrinsic to development in higher organisms involving morphogen-TF-chromatin modifier pathways.
Similar content being viewed by others
Introduction
How a single cell, the fertilized egg, gives rise to all cells in an individual remains an unresolved question2. Intuitively, the entire process is quite simple, i.e., a series of cell divisions giving rise to the total number and types of cells in an individual, such as the estimated two hundred trillion cells in a human being3. The binary process of doubling has been well understood under the framework of cell cycle4. How cell fate decision is specified at each division remains largely unknown at the molecular level. Genes involved in cell fate decisions have been identified in model organisms mostly. For example, the complete cell lineage of C. elegans enabled the identification of genes involved in an extreme case of cell fate decision5, cell death, which laid the foundation of subsequent biochemical delineation of the cell death pathway6. the fruit fly has provided an early link between chromosome aberration and developmental defects such as the homeotic mutations leading to the realization of developmental programming hardwired in the genome, i.e., the co-linearity of HOX clusters and body segmentations conserved in both invertebrates and vertebrates7,8,9. However, unlike cell death which can be modeled quite faithfully in vitro, developmental cell fate decisions to generate diverse cell types have not been modeled successfully in vitro. A fertilized mouse egg undergoes the first time cell fate decision, giving rise to trophectoderm or TE and inner cell mass or ICM, around E4.5-E5.5, mouse ICM makes the second fate decision to generate hypoblast or primitive endoderm and epiblast10,11. Compared to trophectoderm and epiblast, primitive endoderm is rarely analyzed at molecular details like the epiblast, which ESCs serve as a faithful in vitro copy, but could be as critical as the other cells in embryogenesis. For example, PrE releases the nodal signaling molecule, which affects the anterior-posterior axis specialization process during embryonic gastrulation12. Knockout of PrE related genes, for instance, Sox17 and Lama1, arrests embryonic development13,14. Recent studies performed on blastoids indicate that PrE over-differentiation might cause intrauterine development lethality15. However, the underlying mechanism of PrE lineage specification remains unknown. BMP signals have been reported to play an important role in PrE based on studies using dominant negative forms of BMP receptor 2 and small molecule antagonists16. Similarly, Sall4 is required for the development of the epiblast and primitive endoderm17, Sall4 KO embryos arrest around E6.5 slightly later than the embryo implantation process18, a stage controlled by PrE. However, the relationship between Sall4 and Bmp4 has not been reported so far in any cell fate decision process. Here we report an in vitro model of cell fate decision whereby BMP4 specifies PrE fate by dissolving the pluripotent-bound SALL4-NuRD complex19. This system may allow detailed biochemical analysis to delineate the molecular mechanisms associated with cell fate decisions.
Results
BMP4 blocks pluripotency induction
We have recently shown that JGES(Jdp2-Glis1-Esrrb-Sall4) can convert E13.5 mouse embryonic fibroblasts or MEFs to naive pluripotency (~E3.5 inner cell mass or ICM) in a NuRD-dependent manner20. We then wished to improve this process by testing various factors previously known to enhance reprogramming. We surprisingly find that BMP4, a component of the TGF-b superfamily, previously shown to improve OSKM reprogramming21,22, inhibits JGES reprogramming dramatically (Fig. 1a–c). This unexpected finding suggests that BMP4 might have diverted the cell fate trajectory away from pluripotency along the reprogramming pathway. To gain further insight into this process, we ask if BMP4 inhibition is time-dependent and show that there is a window of sensitivity in the first 3 days (Fig. 1d). We also show that the system is dose-dependent (Fig. 1e, f), with ~1 ng/ml capable of inhibiting ~50%. These results demonstrate that BMP4 blocks JGES reprogramming to pluripotency in a time and dose-dependent manner.
We then try to address whether BMP4 relies on the known downstream regulators such as the SMADs, a family of proteins known to mediate TGF-b super-family signaling either positively or negatively23. Specifically, we show that Smad6 and Smad7, inhibitory or I-Smad antagonize BMP4 effectively and restore reprogramming24 (Supplementary Fig. 1a, b). It is of interest to note that Smad6 has a better rescue efficiency than Smad7 as it is more specific to BMPs25. These results suggest that as a classic morphogen, BMP4 specifies the choice between pluripotent and alternative fates during JGES reprogramming.
BMP4 diverts cell fate toward extra-embryonic lineages
To identify the fate trajectory diverted by BMP4, we performed single-cell RNA-seq on JGES reprogrammed cells at day 7 with or without BMP4 as illustrated (Fig. 2a) and show that there is significant overlap among various intermediates and endothelial cells between BMP4− and BMP4+ samples (Fig. 2b, c and Supplementary Fig. 2a). However, there is a clear separation of the pluripotent cluster in BMP4− vs. primitive endoderm cell-like cells (PrECLCs) cluster in BMP4+ population (Fig. 2b, c). In addition, there are also minor clusters specific for placenta-like cells in the BMP4+ population (Fig. 2c).
Detailed analysis further reveals four major clusters with representative marker genes labeled on the top (Fig. 2d), including (1) the intermediate cluster is related to ossification and kidney development, (2) the endothelial cluster is related to vasculogenesis and endothelial development, (3) the placenta-like cluster is related to placenta development and epithelial cell morphogenesis, and finally (4) the PrECLCs cluster is related to pattern specification process and endoderm development (Fig. 2e). Interestingly, among the clusters in BMP4+ and BMP4− cells at day 7, ~20% of cells are either PrECLCs or iPSCs respectively (Fig. 2f). It is of interest that, in the absence of BMP4, both PrECLCs and endothelial-like cells can be identified, albeit at much lower frequencies, 0.016% and 0.058% (Fig. 2f), suggesting that the JGES reprogramming is capable of generating quite diverse cell types without BMP4.
To further define the mechanism specifying pluripotency vs. PrE16,17, we screen factors for PrECLCs based on public internal datasets (Fig. 2g). By qPCR (Supplementary Fig. 2b) and bulk RNA-seq (Supplementary Fig. 2c), we show that genes such as Sox17 and Gata4/6 are highly enriched in PrECLCs26,27. To see if these genes play any potential role in the bifurcating decision between pluripotent and PrE fates, we tested each gene in JGES and showed that Gata4 is a critical inhibitor in blocking pluripotent reprogramming (Fig. 2h).
We further validated several critical PrE markers by immunofluorescence and show that GATA4+/LAMA1+ clones are present in JGES reprogramming28 (Fig. 2i). Indeed, we can identify PrECLCs clones at day 9 in both BMP4− and BMP4+ JGES reprogramming (Supplementary Fig. 2d) with clear boundaries, plump cell morphology, and very condense extracellular matrix, a critical characteristic of primitive endoderm29. In an effort to match these in vitro generated PrECLCs with mouse embryonic cells, we compared them to those reported in E4.5–E5.5 embryos (Supplementary Fig. 2e, f) and show that, indeed PrECLCs cluster with primitive endoderm (PrE) and pluripotent cells cluster with epiblast. Additionally, we find that PrECLCs are closer to PrE than parietal endoderm (PaE) or vesical endoderm (VE) in vivo (Supplementary Fig. 2g), confirming them as PrE cell-like cells or PrECLCs. In summary, the JGES reprograming system could reset MEFs into pluripotent states or alternative fates, such as those from an extra-embryonic lineage, in a BMP4-sensitive manner.
BMP4 targets SALL4 to specify alternative fates
The fact that BMP4 clearly inhibits JGES reprogramming suggests that it mediates cell fate decisions in a TF-dependent manner. Indeed, we show that BMP4 dramatically enhances OS (Oct4+Sox2) reprogramming efficiency in iCD3 as previously described30 (Fig. 3a), thus ruling out any role iCD3 may play in the inhibitory effect. We then focused on each individual TF by performing drop-out experiments with Jdp2, Glis1, Esrrb and Sall4 as illustrated (Fig. 3b), and show that we can rule out Jdp2 and Glis1, but not Esrrb and Sall4 for they are both important in pluripotency induction (Fig. 3c). Since dropping either Esrrb or Sall4 lowers reprogramming efficiency to such a negligible level, we introduce Oct4, a factor not involved in BMP4-mediated inhibition but important in pluripotency induction, to JGES, and show that BMP4 remains capable of blocking JGESO reprogramming by reducing the efficiency by ~70% (Fig. 3d). We repeated the dropout experiments with JGESO and show that Sall4 is the only factor conferring sensitivity to BMP4, remarkably, dropping out Sall4 in fact renders the remaining JGEO responsible to BMP4 positively (Fig. 3e). BMP4 also enhances OS + JGE reprogramming as expected (Fig. 3f). Alternatively, we tested each of JGES one by one in OS and show that BMP4 enhances reprogramming, except when Sall4 is added (Fig. 3g). When Sall4 is added to OS, BMP4 become an inhibitor of reprogramming, and even when J, G, and E are present alone or together (Fig. 3h). These results demonstrate clearly that BMP4 targets Sall4 to block iPSC reprogramming.
To clarify BMP4–SALL4 axis in pluripotency inhibition and PrE formation, we perform bulk RNA-seq on JGE, JGES, JGEO, JGESO under BMP4+ and BMP4− at day7, when SALL4 exists, BMP4 exhibits inhibition effect on pluripotent genes and promotion effect on PrE genes (Supplementary Fig. 3a–d). We repeated the same set of dropout experiments with BMP4 treatment again, qPCR results show that Jdp2 is the only factor to inhibit PrE cell fate, the other three factors, including Sall4, work cooperatively to enhance PrE cell fate (Fig. 3i). On the other hand, when adding JGES one by one to OS under BMP4 treatment, Sall4 turns out to be the only factor that enhances PrE gene expression (Fig. 3j). These results suggest that the BMP4–SALL4 axis acts synergistically to impede pluripotency, and promote PrE fate.
BMP4 dissociates SALL4 from NuRD
We have previously shown that NuRD is important to orchestrate iPSC reprogramming in JGES, in contrast to its reported role as a barrier in OKSM reprogramming. We further reported that NuRD is recruited by SALL4 to close somatic loci via its N-terminal 12 AA residues20. To test if BMP4 may disrupt the NuRD–SALL4 axis, we performed IP-MS on SALL4 in BMP4+ and BMP4− groups on day 3 (Fig. 4a) and show, by volcano map, that SALL4-NuRD interaction is disrupted (Fig. 4b, Supplementary Data 1). Components of NuRD are significantly down-regulated in BMP4+ IP-MS experiments (Fig. 4b) and confirmed in Co-IP experiments (Fig. 4c). These results indicate that BMP4 may block iPSC reprogramming by disrupting the SALL4–NuRD axis.
To directly test if the disrupted cooperation plays a crucial role, we constructed inducible fusion constructs between SALL4 and three components of NuRD, i.e., GATAD2B, MTA1 and MBD3 as illustrated in (Supplementary Fig. 4a). Interestingly, while SALL4-MTA1 or MBD3 has minimal effect, SALL4-GATAD2B can significantly enhance iPSC generation, furthermore, when SALL4-GATAD2B, SALL4-MTA1, SALL4-MBD3 fusion constructs are expressed together at the first three days, the reprogramming efficiency could be enhanced in a synergistically fashion (Fig. 4d), (Supplementary Fig. 4b). These results suggest that covalent fusion between SALL4-NuRD can partially restore iPSC reprogramming in the presence of BMP4.
We also tested the SALL4-NuRD cooperation in an alternative way. We took advantage of our earlier finding that the N terminal 12AAs (N12) of SALL4 plays a critical role in cooperation with NuRD in reprogramming. We tested the effect of BMP4 on N12-JDP2, which was previously shown to be able to rescue mutants such as SALL4K5A, along with Glis1 and Esrrb. As shown in Fig. 4e, BMP4 could also inhibit the reprogramming efficiency of this system by dissociating N12-JDP2 and GATAD2B interaction (Supplementary Fig. 4c). These results suggest that BMP4 blocks reprogramming by disrupting the N12–NuRD interaction. In fact, we show that BMP4 only improves reprogramming when SALL4 no longer can interact with NuRD comparing reprogramming efficiency between SALL4WT with SALL4delN12 and SALL4K5A. (Fig. 4f), While SALL4delN12 fails to promote iPSC generation (Fig. 4g, h), it can also effectively enhance PrE-related gene expression such as Gata4 (Fig. 4i).
BMP4 activates PrE regulons
BMP4 is critical for many physiological functions. To probe its role in our system, we performed regulon analysis with our scRNA-sequencing data, and identified five top regulons centered on Sox17, Pitx1, Klf4, Gata4, and Foxa2 in PrECLCs (Fig. 5a). Interestingly, we show that PrE genes are activated by Gata4 with the rest of the top 5 regulons indicated (Fig. 5b–d). Interestingly, Gata4 is restricted in the PrECLCs cluster. When overexpressed, Gata4 is the most robust regulon in blocking the pluripotent fate (Fig. 5e). Consistently, qPCR results show that Gata4 activates PrE genes significantly more than the other four factors, while Sox17 and Foxa2 only elevate PrE gene expression slightly (Fig. 5f). Knocking down Gata4 leads to PrE genes down-regulation in JGES (Supplementary Fig. 5a, b). Furthermore, we also show that Gata4 can be elevated by BMP4 (Supplementary Fig. 2b). The mutant SALL4delN12, unable to recruit NuRD, can activate PrE genes expression more than SALL4WT, thus, phenocopying BMP4 (Fig. 4g–i). These results, taken together, suggest that BMP4 activates PrE fate at the expense of pluripotent one through the SALL4–NuRD axis.
Induction of PrE by SALL4delN12 alone
The fact that dissociation of SALL4 from NuRD by BMP4 diverts reprogramming away from pluripotent to PrE suggests that Sall4 plays a central role in the fate decision between epiblast- vs. hypoblast-fates. We further hypothesize that Sall4 alone may be able to specify PrE fate. Previous studies have shown that Esrrb has the ability to reset MEF cells to an induced extra-embryonic endoderm (iXEN) state31. We also have preliminary evidence that Sall4, Esrrb and Glis1 work cooperatively in PrECLCs induction (Fig. 3i). To test the direct relation between Sall4 and PrE cell fate, we infected MEFs with single factor SALL4WT and SALL4delN12 and show that GATA4 and LAMA1 double positive PrECLCs clones can emerge from both SALL4WT and SALL4delN12 at day 1132 (Fig. 6a). To distinguish single factor-induced PrECLCs from JGES induced PrECLCs, we name them iPrEs. However, the earliest iPrE clones appear in SALL4delN12 at D3 compared to SALL4WT at D9 (Fig. 6b). We performed in situ immunofluorescence experiments of GATA4 and show that SALL4delN12 is indeed more robust in iPrE induction than SALL4WT (Fig. 6c, d). These results are consistent with the scRNA-sequence data, in which JGES BMP4− group has a very rare PrECLC population while JGES BMP4+ has a much greater PrECLC population, perhaps as a result of dissociation of SALL4-NuRD by BMP4. We further show by bulk RNA-seq that the PrE markers are expressed in SALL4delN12 at a higher level than in SALL4WT (Fig. 6e, f and Supplementary Fig. 6a).
In order to estimate the characteristics of iPrE, we then construct Sall4WT-Flag and Sall4delN12-Flag fusion proteins into retroviral vector PMXs, to monitor their expression during iPrE generation and show that both are silenced at D11. Immunofluorescence by FLAG or SALL4 antibodies to detect exo- or whole SALL4 expression suggests that exo-SALL4 is silenced in iPrE colonies and endo-SALL4 slightly activated33 (Supplementary Fig. 6b). The iPrE cells could be passaged and exhibit robust proliferation in vitro (Supplementary Fig. 6c). Furthermore, the iPrEs generated by SALL4delN12 proliferate better than those by SALL4WT. We also show that in suspended culture, iPrE cells form spherical structures with monolayer cavities indicative of polarity and condensed extracellular matrix, mimicking the PrE property in vivo34. Interestingly, these spherical structures can be passaged by trypsin (Supplementary Fig. 6d).
We then performed blastocyst injection experiments to evaluate iPrE developmental potential27 by marking SALL4WT induced iPrE and MEF cells with GFP (Supplementary Fig. 6e) and injecting them into E3.5 blastocysts. The MEFs-GFP cells disappear within 48 h, while iPrEs-GFP remains viable in vitro under identical conditions (Supplementary Fig. 6f). When injected blastocysts shown in (Supplementary Fig. 6g) were implanted into surrogate female mice, we can detect GFP-positive cells in embryo yolk sac in iPrEs-GFP group but none in MEFs-GFP group at E12.511,35 (Supplementary Fig. 6h, i), suggesting that iPrE is capable of integrating into extra-embryonic tissues.
To further characterize SALL4WT and SALL4delN12 iPrE cells, we performed ATAC-sequencing and Cut&Tag experiments and showed that chromatin loci with motifs from FOX, SOX, GATA and KLF family are opened significantly higher in SALL4delN12 than SALL4WT (Fig. 6g, Supplementary Fig. 6i). Results from SALL4 Cut&Tag experiments are quite similar between SALL4WT and SALL4delN12 for PrE genes (Fig. 6h, i), however, H3K27ac is more enriched in SALL4delN12 than SALL4WT among PrE genes, consistent with the fact that the former fails to recruit NuRD (Fig. 6j, k).
Discussion
We provide here an in vitro model system to analyze early cell fate decisions in development, the choice to becoming epiblast- or hypoblast-cells. Unlike the canonic developmental process of inner cell mass segregating into epiblasts and hypoblasts, we utilize a reprogramming approach, converting E13.5 MEFs to pluripotent iPSCs or PrECLCs. We demonstrate that BMP4 plays a crucial role in specifying PrEs away from iPSCs involving SALL4 and NuRD complex.
PrEs are much less understood compared to epiblasts as it lacks an in vitro equivalent as iPSCs or ESCs for epiblasts. But PrEs are emerging as critical as they provide a critical structural components and function of the extra embryonic tissues36,37,38. Regenerating PrE cells through reprogramming may provide not only a reliable source of these cells, but also a system to analyze their properties for hypoblast as iPSC or ESC for epiblasts.
Apart from the hypo- and epiblast models, we also uncovered a previously unrecognized cell fate decision axis, linking a classic morphogen BMP4, to a well-known developmentally critical transcription factor SALL4, then to a less well-understood player in development and cell fate control, the NuRD complex. It would be of great interest to see similar paradigms for cell fate decisions in normal development and cancer39.
Among the axis members, Sall4 has been shown to play an important role in the three key lineages, epiblast, hypoblast, and trophectoderm, during early embryogenesis40,41. Intriguingly, Sall4 seems to play critical roles in multiple lineages and potentially many cell fate decisions42, it is conceivable that BMP4 may further provide a signal to switch between various fates and mechanistic actions. This feature may become relevant to assign specific activity of Sall4 in carcinogenesis as reported previously. As Sall4 has been reported as a reprogramming factor for iPSC in several studies, our results highlight its role in PrE cell fate formation while introducing new inquiries into how SALL4delN12 or SALL4WT remodels MEF cell fate to PrE cell fate. Given the fact that Sall4 has been shown to take part in limb and genital cell development in ontogenetic process43,44,45, along with BMPs or TGF-b family who participate in the same processes46,47, the cooperative and antagonistic functional research on them are still areas yet to be developed in further studies.
Methods
Mice
Oct4-GFP (OG2) transgenic-allele-carrying mice (CBA/CaJ×C57BL/6J) were purchased from The Jackson Laboratory (Mouse strain datasheet: 004654). 129S4/SvJaeJ and ICR mice were purchased from the Beijing Vital River Laboratory. Animals were individually housed under a 12 h light/dark cycle and provided with food and water ad libitum, ambient temperature is about 20−26 °C and humidity is about 40–70%. Our studies followed the guidelines for the Care and Use of Laboratory Animals of the National Institutes of Health, and the protocols were approved by the Committee on the Ethics of Animal Experiments at the Guangzhou Institutes of Biomedicine and Health. OG2 mice and 129Sv/Jae mice were used to generate E13.5 mouse embryo fibroblast (MEFs) and ICR mice were used to apply donor blastocyst and pseudo-pregnant mice.
DNA constructs, cell lines and cell culture
pMXs plasmids, pKD plasmids, pLVX plasmids, pB plasmids were used for in vitro over-expression. pMXs (retrovirus vector) were regularly used if not extra mentioned, pKD was used to overexpress rtTA, pLVX carries a TRE3G (Tet-on system) promoter, pB plasmid coupled with pBase plasmid were used to overexpress GFP as indicated. MEFs were obtained from E13.5 mouse embryos regardless of sex by crossing male OG2 mice to 129S4/SvJaeJ/female mice, after removing the integral organs, the tail, the limbs and head, the remaining tissues were cut into small pieces and then digested (0.25% trypsin: 0.05% trypsin = 1:1; GIBCO) for 10 min at 37 °C to a single cell suspension. The isolated MEF cells were seed on 0.2% gelatin (home-made) coated dish, cultured in fibroblast medium: DMEM-high glucose (Hyclone) contains 10% FBS(NTC, SFBE, HK-026), 1% GlutaMAX (GIBCO), 1% sodium pyruvate (GIBCO), 1% NEAA (GIBCO) and 0.1 mM 2-mercaptoethanol (GIBCO). Plat-E cells were cultured in DMEM high-glucose media (Hyclone) supplemented with 10% FBS (NTC, SFBE, HK-026). iPrE cells derived from mouse embryonic fibroblast cells cultured in iCD3: DMEM-high glucose (Hyclone) contains TV, VC, CHIR-99021, bFGF, mLIF, SGC0946, GSK-LSD12HCL, Y27632. 1% sodium pyruvate (GIBCO), 1% non-essential amino acids (GIBCO), 1% GlutaMAX (GIBCO), 0.1 mM 2-mercaptoethanol (GIBCO), N2 (GIBCO), B27 (GIBCO). All the cell lines have been confirmed as mycoplasma contamination-free with the Kit from Lonza (LT07-318).
iPSCs generation
Plat-E cells and 293T cells were seeded at the concentration of 7.5 × 106−8.5 × 106 Cells per 10-cm dish 12–16 h before transfection uniformly, and cultured in high-glucose DMEM (HyClone, SH30022.01) supplemented with 10% FBS (NTC, SFBE, HK-026) medium. For each 10-cm dish, replacing the Plat-E cells or 293T cells medium with 9 mL fresh 10% FBS firstly, 10 μg DNA was added into 1 ml opti-MEM(GIBCO), mix the liquid immediately after adding 40 μl PEI (1 mg/ml, YEASEN, 40816ES08), for 293T transfection, PSPAX2, PMD2G (6 μg:4 μg) were extra used. After incubating for 10–15 min at room temperature, the mixture should be gently transferred onto the Plat-E cell or 293T cells. Replace the medium with 10 ml 10% FBS within 10–16 h. And then, the retrovirus should be collected twice, 48 and 72 h after transfection, lentivirus should be collected once, 48 h after transfection. The supernatant containing the virus was collected at each time by a syringe and filter through a 0.45 μm filter, 10 ml fresh 10% FBS medium was added to the Plat-E cells after the first collection, the virus can be stored at room temperature for 48 h. Thawing the frozen Passage 1 OG2 MEFs (mouse embryonic fibroblast cells) into a 6 cm dish with 10% FBS medium and cultured in a 5% CO2 incubator when conducting the transfection. Then, split the MEFs into a 24-well plate at 1.5 × 104 cell density per well before infection. MEF cells should be infected by retrovirus twice and lentivirus once. Mix the virus stock at proper volume (Jdp2:Glis1:Esrrb:Sall4 = 2:1:1:2) and one volume of fresh 10% FBS medium, then mix polybrene at a final concentration of 4 mg/ml, Y27632 at a final concentration of 5 μM before infection. The second virus infection should be conducted 24 h later, lentivirus were infected at the second time. After infection for 2 days, replace the medium with iCD3 or iCD3 plus BMP4 (RD systems, 314-BP-500), change the medium every 24 h, and observe the morphology change. GFP+ clones are captured by living cells station (NIKON, Bio Station CT) and counted by Image-J using particle analysis.
iPrE generation
MEF cells were seeded onto 24-well plate at 1.5×104 cell density per well, then infected with retrovirus for twice, 24 h each time, iCD3 was used to conduct the generation progress, iPrE will appear gradually. iPrE cells could be digested into single cells by 0.25% trypsin-EDTA, 37 °C, 5 min, along the passages, the extracellular matrix of iPrE cells become more condensed and take a longer digestion time, up to 15 min. iPrE cells could also be cultured in suspension strategy and form a single layer spherical cavity for the cell polarity. Cells could be passaged at a 1:5 ratio, and BMP4 could promote the proliferation of iPrE both in 2D and 3D culture.
Blastocyst injection and embryo transplantation
The iPrE clones (Oct4-GFP negative, with PrE clone morphology) induced by SALL4WT alone are picked by pipette at day 11, after 3 days the patches are digested into single cells or smaller patches by 0.25% trypsin, and after one or two extra-passage to deplete the non-induced cells, the iPrE cells are ready to be labeled by GFP. MEF cells and iPrE cells are transfected with pB-GFP-2A-puro and pBase (1:1) by lipo 3000, and cultured in iCD3.48 h after transfection, 1 μg/ml puromycin was used to remove the un-transfected cells for extra 48 h until the rest of the cells are all GFP positive. Recover the cells for an extra-passage and then digest them into single cells before injection. Donor blastocysts were isolated by M2 medium from the uterus of female ICR mice 3.5 days after coition, GFP positive cells were injected into the cavity of the blastocyst, 10 cells per embryo, and about 30 embryos were injected per group, one-third of the chimeric embryos were cultured in KOSM in vitro for 48 h to observe MEF cells and iPrE cells proliferation, the rest of the chimeric embryos were transplanted into the uterine horn of the pseudo-pregnant female mouse, about 7–10 chimeric embryos were transplanted into each side of the uterine, chimeric embryos were dissected at E12.5.
Flow cytometry
Cells were dissociated into single cells using 0.25% trypsin–EDTA and collected after centrifugation at 250×g for 5 min. After washing with PBS for once, the cell pellet was resuspended with cold PBS containing 0.1% BSA, followed by removing large clumps of cells using a cell strainer (BD Biosciences). The cells were then analyzed by an Accuri C6 flow cytometer (BD Biosciences). The GFP fluorescence cells were detected in the FITC channel. Data analysis was performed using FlowJo v.7.6.1.
Immunofluorescence
Cells growing on a 96-well dish were washed 3 times with PBS, then fixed with 4% PFA for 0.5 h, after washing 3 times, 10 min per time, by PBS and subsequently penetrated and blocked with 0.2% Triton X-100 and 3% BSA (1:1) for 0.5 h at room temperature. Then, the cells were washed 3 times, 10 min per time, and incubated with primary antibody diluted with 3% BSA for 2 h at room temperature or overnight at 4 °C. After 3 times washing in PBS, the cells were incubated for one hour in second antibodies diluted with 3% BSA at room temperature. After washing 3 times in PBS cells were then incubated in DAPI diluted by PBS for 2 min, plus 3 times washing in PBS. The following antibodies were used in this project: anti-Flag (Sigma Aldrich, F1804, 1:200), anti-SALL4(abcam, ab29112 1:200), anti-GATA4 (Santa Cruz Biotechnology, sc-25310, 1:200), anti-LAMA1(abcam, ab11575 1:200)
Co-immunoprecipitation and western blot
To perform co-immunoprecipitation, cells were digested with 0.25% trypsin and washed 3 times in PBS, whole cell extracts were prepared using lysis buffer (50 mM Tris pH 7.4, 200 mM NaCl, 10% Glycerol, 1% NP40, 1 mM EDTA) with freshly added 1× Complete Protease inhibitors (Sigma, 1187358001) and 1% PMSF, incubated for 20 min on ice and then 1 h at 4 °C on a rotation wheel. Soluble cell lysates were collected after maximum speed centrifugation at 4 °C for 15 min, part of the supernatant was kept as input, the remaining supernatant was incubated with anti-FLAG beads, DYKDDDDK (Thermo Fisher, A36797) overnight at 4 °C on a rotation wheel. Beads were then washed with wash buffer (50 mM Tris pH 7.4, 200 mM NaCl, 10% Glycerol, 0.01% NP40, 1 mM EDTA) 10 times by inverting the tube on ice. After complete removal of the cell wash buffer, immunoprecipitated proteins with FLAG beads were boiled at a 100 °C incubator in loading buffer (4% SDS, 10% 2-Mercaptoethanol, 20% Glycerol, 0.004% Bromophenol blue, 0.125 M Tris Ph 6.8) for 10 min, Whole protein extract was stored at −80 °C and avoid freeze and thaw cycle. To perform western blot, input or IP extract was analyzed by SDS–PAGE and then transferred to PVDF membrane (Millipore, 0.45 μm). After incubation with indicated antibodies, the membrane was exposed to X film. NuRD Complex Antibody Sampler Kit (CST, 8349T), anti-GATAD2B (Abcam, ab224391), anti-RBBP4 (Novusbio NB500-123), anti-FLAG (Sigma Aldrich, F1804 1:1000), anti-SALL4 (Abcam, ab29112), were used.
Processing of scRNA-sequencing data
The FASTQ files of single-cell libraries were generated from Illumina NovaSeq. The clean FASTQ files were aligned to the Mm10 genome with mouse gene annotation of Gencode vM21 version by STARsolo function of STAR (2.7.6a)48. Low-quality cells were filtered out by the number of unique molecular identifiers (UMIs) and total counts following the pipeline of Python package Scanpy49. Gene regulatory network (GRN) analysis: We performed GRN analysis using pySCENIC50. We obtained a regulon score for all cells of each transcription factor. The importance of transcription factors for each cell type was ordered by normalized enrichment score.
Bulk RNA-seq and data analysis
The RNA-seq reads were trimmed using Trim Galore (v0.6.4)51,52 and then mapped to the mm10 reference genome with HISAT2 (v2.2.1)53, and StringTie (v2.2.1)54 was used to quantify the transcription level of each gene in each sample into fragments per kilobase of exon model per million mapped reads (FPKM). GFOLD (v1.1.4)55 was used to perform differential expression analysis between conditions. The differentially expressed genes were identified with a gfold value >0.5 or less than −0.5.
ATAC-seq and data analysis
The ATAC-seq reads were trimmed by Trim Galore (v0.6.4) and then mapped to the mm10 reference genome using bowtie2 (v2.4.5)56, and SAMtools (v1.16.1)57 was used to remove the unpaired, low sequencing quality (mapq < 30) and the mitochondrial DNA mapped reads in the total mapped reads. The reads that lengths <50 base pairs (bp) were isolated for subsequent analysis. In order to make the data comparable between different sequencing depths, the signals were normalized to one million reads for each sample, and the values were further compressed into a binary format (bigWig) for downstream analysis and data visualization. Peak calling was performed using MACS (v1.4.2)58 with parameters as follows: -g mm --keep-dup all --nomodel --shiftsize 25.
Cut&Tag and data analysis
The CUT&Tag reads were trimmed by Trim Galore (v0.6.4) and then mapped to the mm10 reference genome using bowtie2 (v2.4.5). SAMtools (v1.16.1) was used to remove the repetitive, low sequencing quality (mapq < 30) and the mitochondrial DNA mapped reads in the total mapped reads. The values were further compressed into a binary format for downstream analysis and data visualization. Replicates were merged using samtools merge and then peak calling was performed using MACS (v1.4.2) with parameters as follows: -g mm --keep-dup 1 --nomodel --shiftsize 73. The signals were normalized to one million reads for each sample. Promoters were defined as regions ± 2 kb around transcription start sites (TSSs) of genes.
Motif analysis
Motif scans were performed using HOMER (v4.11.1)59 against the genome sequence of the given ATAC-seq peaks covered regions (summits ± 25 bp) with the following parameters: -size given -mask. HOMER used a hypergeometric test to determine the motif enrichment and also test the similarity between the motif we identified and known factors. Motifs that have p-value < 10−5 and enrichment score > 3 are presented in the plot.
Gene Ontology analysis
Functional annotation was performed using the clusterProfiler (v4.6.2)60. Gene Ontology terms for each functional cluster were summarized to a representative term, and adjusted p-values were plotted to show the significance.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The scRNA-Seq, bulk RNA-Seq, ATAC-seq and Cut&Tag data have been deposited in the Gene Expression Omnibus database under the accession code GSE242851. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the iProX partner repository61,62. with the dataset identifier PXD051433. Source data are provided with this paper.
References
Lambert, S. A. et al. The human transcription factors. Cell 172, 650–665 (2018).
Qiu, C. et al. Systematic reconstruction of cellular trajectories across mouse embryogenesis. Nat. Genet. 54, 328–341 (2022).
Khan, Y. S. & Farhana, A. in StatPearls (StatPearls Publishing Copyright © 2024, StatPearls Publishing LLC, 2024).
Le Bras, S., Rondanino, C., Kriegel-Taki, G., Dussert, A. & Le Borgne, R. Genetic identification of intracellular trafficking regulators involved in Notch-dependent binary cell fate acquisition following asymmetric cell division. J. Cell Sci. 125, 4886–4901 (2012).
Spector, M. S., Desnoyers, S., Hoeppner, D. J. & Hengartner, M. O. Interaction between the C. elegans cell-death regulators CED-9 and CED-4. Nature 385, 653–656 (1997).
Wilkinson, H. A., Fitzgerald, K. & Greenwald, I. Reciprocal changes in expression of the receptor lin-12 and its ligand lag-2 prior to commitment in a C. elegans cell fate decision. Cell 79, 1187–1198 (1994).
Gibson, G. Insect evolution: redesigning the fruitfly. Curr. Biol. 9, R86–89, (1999).
Gehring, W. J. Chance and necessity in eye evolution. Genome Biol. Evol. 3, 1053–1066 (2011).
Hajirnis, N. & Mishra, R. K. Homeotic genes: clustering, modularity, and diversity. Front. Cell Dev. Biol. 9, 718308 (2021).
Chowdhary, S. & Hadjantonakis, A. K. Journey of the mouse primitive endoderm: from specification to maturation. Philos. Trans. R. Soc. Lond. B Biol. Sci. 377, 20210252 (2022).
Filimonow, K. & de la Fuente, R. Specification and role of extraembryonic endoderm lineages in the periimplantation mouse embryo. Theriogenology 180, 189–206 (2022).
Varlet, I., Collignon, J. & Robertson, E. J. Nodal expression in the primitive endoderm is required for specification of the anterior axis during mouse gastrulation. Development 124, 1033–1044 (1997).
Igarashi, H. et al. Sox17 is essential for proper formation of the marginal zone of extraembryonic endoderm adjacent to a developing mouse placental disk. Biol. Reprod. 99, 578–589 (2018).
Miner, J. H., Li, C., Mudd, J. L., Go, G. & Sutherland, A. E. Compositional and structural requirements for laminin and basement membranes during mouse embryo implantation and gastrulation. Development 131, 2247–2256 (2004).
Liu, K. et al. Bilineage embryo-like structure from EPS cells can produce live mice with tetraploid trophectoderm. Protein Cell 14, 262–278 (2023).
Graham, S. J. et al. BMP signalling regulates the pre-implantation development of extra-embryonic cell lineages in the mouse embryo. Nat. Commun. 5, 5667 (2014).
Elling, U., Klasen, C., Eisenberger, T., Anlag, K. & Treier, M. Murine inner cell mass-derived lineages depend on Sall4 function. Proc. Natl Acad. Sci. USA 103, 16319–16324 (2006).
Sakaki-Yumoto, M. et al. The murine homolog of SALL4, a causative gene in Okihiro syndrome, is essential for embryonic stem cell proliferation, and cooperates with Sall1 in anorectal, heart, brain and kidney development. Development 133, 3005–3013 (2006).
Hu, G. & Wade, P. A. NuRD and pluripotency: a complex balancing act. Cell Stem Cell 10, 497–503 (2012).
Wang, B. et al. The NuRD complex cooperates with SALL4 to orchestrate reprogramming. Nat. Commun. 14, 2846 (2023).
Samavarchi-Tehrani, P. et al. Functional genomics reveals a BMP-driven mesenchymal-to-epithelial transition in the initiation of somatic cell reprogramming. Cell Stem Cell 7, 64–77 (2010).
Takahashi, K. et al. Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell 131, 861–872 (2007).
Hayashi, Y. et al. BMP-SMAD-ID promotes reprogramming to pluripotency by inhibiting p16/INK4A-dependent senescence. Proc. Natl Acad. Sci. USA 113, 13057–13062 (2016).
Padgett, R. W., Cho, S. H. & Evangelista, C. Smads are the central component in transforming growth factor-beta signaling. Pharm. Ther. 78, 47–52 (1998).
Estrada, K. D., Retting, K. N., Chin, A. M. & Lyons, K. M. Smad6 is essential to limit BMP signaling during cartilage development. J. Bone Min. Res. 26, 2498–2510 (2011).
Pedroza, M. et al. Self-patterning of human stem cells into post-implantation lineages. Nature 622, 574–583 (2023).
Ohinata, Y. et al. Establishment of mouse stem cells that can recapitulate the developmental potential of primitive endoderm. Science 375, 574–578 (2022).
Pijuan-Sala, B. et al. A single-cell molecular map of mouse gastrulation and early organogenesis. Nature 566, 490–495 (2019).
Zhang, Z., Zwick, S., Loew, E., Grimley, J. S. & Ramanathan, S. Mouse embryo geometry drives formation of robust signaling gradients through receptor localization. Nat. Commun. 10, 4516 (2019).
Chen, J. et al. BMPs functionally replace Klf4 and support efficient reprogramming of mouse fibroblasts by Oct4 alone. Cell Res. 21, 205–212 (2011).
Benchetrit, H. et al. Direct induction of the three pre-implantation blastocyst cell types from fibroblasts. Cell Stem Cell 24, 983–994.e987 (2019).
Lauberth, S. M. & Rauchman, M. A conserved 12-amino acid motif in Sall1 recruits the nucleosome remodeling and deacetylase corepressor complex. J. Biol. Chem. 281, 23922–23931 (2006).
Schorn, A. J., Gutbrod, M. J., LeBlanc, C. & Martienssen, R. LTR-retrotransposon control by tRNA-derived small RNAs. Cell 170, 61–71.e11 (2017).
Tosenberger, A. et al. A multiscale model of early cell lineage specification including cell division. NPJ Syst. Biol. Appl. 3, 16 (2017).
Stremmel, C. et al. Yolk sac macrophage progenitors traffic to the embryo during defined stages of development. Nat. Commun. 9, 75 (2018).
Egea, J. et al. Genetic ablation of FLRT3 reveals a novel morphogenetic function for the anterior visceral endoderm in suppressing mesoderm differentiation. Genes Dev. 22, 3349–3362 (2008).
Hermitte, S. & Chazaud, C. Primitive endoderm differentiation: from specification to epithelium formation. Philos. Trans. R. Soc. Lond. B Biol. Sci. 369, https://doi.org/10.1098/rstb.2013.0537 (2014).
Bergmann, S. et al. Spatial profiling of early primate gastrulation in utero. Nature 609, 136–143 (2022).
Tatetsu, H. et al. SALL4, the missing link between stem cells, development and cancer. Gene 584, 111–119 (2016).
Lim, C. Y. et al. Sall4 regulates distinct transcription circuitries in different blastocyst-derived stem cell lineages. Cell Stem Cell 3, 543–554 (2008).
Zhang, J. et al. Sall4 modulates embryonic stem cell pluripotency and early embryonic development by the transcriptional regulation of Pou5f1. Nat. Cell Biol. 8, 1114–1123 (2006).
Xiong, J. et al. Cooperative action between SALL4A and TET proteins in stepwise oxidation of 5-methylcytosine. Mol. Cell 64, 913–925 (2016).
Rodríguez-Pérez, F. & Rape, M. Unlocking a dark past. Elife 7, https://doi.org/10.7554/eLife.41002 (2018).
Mansour, A. A. et al. The H3K27 demethylase Utx regulates somatic and germ cell epigenetic reprogramming. Nature 488, 409–413 (2012).
Yamaguchi, Y. L. et al. Sall4 is essential for mouse primordial germ cell specification by suppressing somatic cell program genes. Stem Cells 33, 289–300 (2015).
Lozovska, A. et al. Tgfbr1 controls developmental plasticity between the hindlimb and external genitalia by remodeling their regulatory landscape. Nat. Commun. 15, 2509 (2024).
Yu, S. et al. BMP4 drives primed to naïve transition through PGC-like state. Nat. Commun. 13, 2756 (2022).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Wolf, F. A., Angerer, P. & Theis, F. J. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 19, 15 (2018).
Van de Sande, B. et al. A scalable SCENIC workflow for single-cell gene regulatory network analysis. Nat. Protoc. 15, 2247–2276 (2020).
Nanak, E., Vijayalakshmi, M. A. & Chadha, K. C. Segregation of normal and pathological human red blood cells, lymphocytes and fibroblasts by immobilized metal-ion affinity partitioning. J. Mol. Recognit. 8, 77–84 (1995).
Kechin, A., Boyarskikh, U., Kel, A. & Filipenko, M. cutPrimers: a new tool for accurate cutting of primers from reads of targeted next generation sequencing. J. Comput. Biol. 24, 1138–1143 (2017).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650–1667 (2016).
Feng, J. et al. GFOLD: a generalized fold change for ranking differentially expressed genes from RNA-seq data. Bioinformatics 28, 2782–2788 (2012).
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, https://doi.org/10.1093/gigascience/giab008 (2021).
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Heinz, S. et al. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010).
Wu, T. et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innovations (Cambridge) 2, 100141 (2021).
Ma, J. et al. iProX: an integrated proteome resource. Nucleic Acids Res. 47, D1211–d1217 (2019).
Chen, T. et al. iProX in 2021: connecting proteomics data sharing with big data. Nucleic Acids Res. 50, D1522–d1527 (2022).
Acknowledgements
We thank the experimental platforms at Westlake University, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, we also appreciate the help from all the members of Duanqing Pei’s lab, also in Jing Liu’s lab and Jiekai Chen’s lab. This work was supported by the National Natural Science Foundation of China (92068201, 32200662), the Key R&D Program of Zhejiang (2024SSYS0033, 2024SSYS0034), Yangtze River Delta Sci-Tech Innovation Community Joint Research Project (2022CSJGG1000),
Author information
Authors and Affiliations
Contributions
J.M. designed the project and performed the main experiments; L.L., C.Z., D.L., and C.L. performed the bioinformatic analysis, and X.H. provided basic bioinformatic counsel; L.W., S.F. and Y.H. performed the cell culture experiments; J.L. performed the blastocyst injection experiments, qPCR experiments, and ATAC-sequence experiments; Y.Y. performed the blastocyst injection experiments; T.H. and W.Z. performed the RNA-seq experiment; Y.F. constructed the plasmids; L.G. performed and X.Z. analyzed the IP-MS experiments; J.K. and Y.Q. provided the experimental counsel. J.C. and J.L. established the basic reprogramming protocol; B.W. established the reprogramming system; D.P. supervised and conceived the whole study, wrote the manuscript, and approved the final version. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ming, J., Lin, L., Li, J. et al. Cell fate decision by a morphogen-transcription factor-chromatin modifier axis. Nat Commun 15, 6365 (2024). https://doi.org/10.1038/s41467-024-50144-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-50144-z
- Springer Nature Limited