|Home | About | Journals | Submit | Contact Us | Français|
MicroRNAs (miRNAs) are crucial for normal embryonic stem (ES) cell self-renewal and cellular differentiation, but how miRNA gene expression is controlled by the key transcriptional regulators of ES cells has not been established. We describe here a new map of the transcriptional regulatory circuitry of ES cells that incorporates both protein-coding and miRNA genes, and which is based on high-resolution ChIP-seq data, systematic identification of miRNA promoters, and quantitative sequencing of short transcripts in multiple cell types. We find that the key ES cell transcription factors are associated with promoters for most miRNAs that are preferentially expressed in ES cells and with promoters for a set of silent miRNA genes. This silent set of miRNA genes is co-occupied by Polycomb Group proteins in ES cells and expressed in a tissue-specific fashion in differentiated cells. These data reveal how key ES cell transcription factors promote the miRNA expression program that contributes to normal self-renewal and cellular differentiation, and integrate miRNAs and their targets into an expanded model of the regulatory circuitry controlling ES cell identity.
MicroRNAs (miRNAs) are crucial for normal embryonic stem (ES) cell self-renewal and cellular differentiation. Marson et al. describe how key ES cell transcription factors promote the miRNA expression program necessary for ES cell state using high-resolution ChIP-seq data, systematic identification of miRNA promoters, and quantitative sequencing of short transcripts in multiple cell types. They use this data to generate a model of ES cell regulatory circuitry that includes both protein-coding and miRNA genes. This information should prove useful as investigators continue to probe the role of miRNAs in pluripotency, cell state, disease and regenerative medicine.
Embryonic stem (ES) cells hold significant potential for clinical therapies because of their distinctive capacity to both self-renew and differentiate into a wide range of specialized cell types. Understanding the transcriptional regulatory circuitry of ES cells and early cellular differentiation is fundamental to understanding human development and realizing the therapeutic potential of these cells. Transcription factors that control ES cell pluripotency and self-renewal have been identified (Chambers and Smith, 2004; Niwa, 2007) and a draft of the core regulatory circuitry by which these factors exert their regulatory effects on protein-coding genes has been described (Boyer et al., 2005; Loh et al., 2006; Lee et al., 2006; Boyer et al. 2006; Jiang et al., 2008; Cole et al., 2008; Kim et al., 2008; Tam et al., 2008). MicroRNAs (miRNAs) are also likely to play key roles in ES cell gene regulation (Kanellopoulou et al., 2005; Murchison et al., 2005; Wang et al., 2007), but little is known about how miRNAs participate in the core regulatory circuitry controlling self-renewal and pluripotency in ES cells.
Several lines of evidence indicate that miRNAs contribute to the control of early development. miRNAs appear to regulate the expression of a significant percentage of all genes in a wide array of mammalian cell types (Lewis et al., 2005; Lim et al., 2005; Krek et al., 2005; Farh et al., 2005). A subset of miRNAs is preferentially expressed in ES cells or embryonic tissue (Houbaviy et al., 2003; Suh et al., 2004; Houbaviy et al., 2005; Mineno et al., 2006). Dicer-deficient mice fail to develop (Bernstein et al., 2003) and ES cells deficient in miRNA processing enzymes show defects in differentiation and proliferation (Kanellopoulou et al., 2005; Murchison et al., 2005; Wang et al., 2007). Specific miRNAs have been shown to participate in mammalian cellular differentiation and embryonic development (Stefani and Slack, 2008). However, how transcription factors and miRNAs function together in the regulatory circuitry that controls early development has not yet been examined.
The major limitation in connecting miRNA genes to the core transcriptional circuitry of ES cells has been sparse annotation of miRNA gene transcriptional start sites and promoter regions. Mature miRNAs, which specify post-transcriptional gene repression, arise from larger transcripts that are then processed (Bartel, 2004). Over 400 mature miRNAs have been confidently identified in the human genome (Landgraf et al., 2007), but only a minority of the primary transcripts have been identified and annotated. Prior attempts to connect ES cell transcriptional regulators to miRNA genes have searched for transcription factor binding sites only close to the annotated mature miRNA sequences (Boyer et al., 2005; Loh et al., 2006; Lee et al., 2006). Additionally, studies of the core transcriptional circuitry of ES cells have compared transcription factor occupancy to mRNA expression data, but have not examined systematically miRNA expression in ES cells and differentiated cell types, limiting our knowledge of transcriptional regulation of miRNA genes in these cells (Boyer et al., 2005; Loh et al., 2006; Lee et al., 2006; Boyer et al. 2006; Jiang et al., 2008; Cole et al., 2008; Kim et al., 2008; Tam et al., 2008).
To incorporate miRNA gene regulation into the model of transcriptional regulatory circuitry of ES cells, we began by generating new, high-resolution, genome-wide maps of binding sites for key ES cell transcription factors using massive parallel sequencing of chromatin immunoprecipitation (ChIP-seq). These data reveal highly overlapping occupancy of Oct4, Sox2, Nanog and Tcf3 at the transcriptional start sites of miRNA transcripts, which we systematically mapped based on a method that uses chromatin landmarks and transcript data. We then carried out quantitative sequencing of short transcripts in ES cells, neural precursor cells (NPCs) and mouse embryonic fibroblasts (MEFs), which revealed that Oct4, Sox2, Nanog and Tcf3 occupy the promoters of most miRNAs that are preferentially or uniquely expressed in ES cells. Our data also revealed that a subset of the Oct4/Sox2/Nanog/Tcf3 occupied miRNA genes are silenced in ES cells by Polycomb Group proteins, but are expressed later in development in specific lineages. High-resolution transcription factor location analysis, systematic mapping of the primary miRNA transcriptional start sites in mouse and human, and quantitative sequencing of miRNAs in three different cell types provide a valuable data resource for studies of the gene expression program in ES and other cells and the regulatory mechanisms that control cell fate. The data also produce an expanded model of ES cell core transcriptional regulatory circuitry that now incorporates transcriptional regulation of miRNAs, and post-transcriptional regulation mediated by miRNAs, into the molecular understanding of pluripotency and early cellular differentiation.
To connect miRNA genes to the core transcriptional circuitry of ES cells, we first generated high-resolution genome-wide maps of Oct4, Sox2, Nanog, and Tcf3 occupancy (Figure 1; similar data was recently described for Oct4, Sox2, and Nanog (Chen et al,)). ChIP-seq allowed us to map transcription factor binding sites and histone modifications across the entire genome at high resolution (Barski et al., 2007; Johnson et al., 2007; Mikkelsen et al., 2007; Robertson et al., 2007), and we optimized the protocol to allow for robust analysis of transcription factor binding in murine ES cells (Supplementary Text). Oct4, Sox2, Nanog and Tcf3 were found to co-occupy 14,230 sites in the genome (Figures (Figures1A,1A, S1, S2 and Table S1–Table S3). Approximately one quarter of these occurred within 8kb of the transcription start site of 3,289 annotated genes, another one quarter occurred within genes but more than 8kb from the start site, and almost half occurred in intergenic regions distal from annotated start sites (Supplementary Text). Binding of the four factors at sites surrounding the Sox2 gene (Figure 1B) exemplified two key features of the data: all four transcription factors co-occupied the identified binding sites and the resolution was sufficient to determine the DNA sequence associated with these binding events to a resolution of <25bp. Composite analysis of all bound regions provided higher resolution and suggested how these factors occupy their common DNA-sequence motif (Figure S4, Table S4). Knowledge of these binding sites provided data necessary to map these key transcription factors to the promoters of miRNA genes.
Imperfect knowledge of the start sites of primary miRNA transcripts has limited our ability to identify the transcription factor binding events that control miRNA gene expression in vertebrates. Previous strategies to identify the 5’ ends of primary miRNAs have been hampered because they relied on isolation of transient primary miRNA transcript, required knowledge of the specific cell type in which each given miRNA is transcribed, or focused only on potential start sites proximal to mature miRNAs (Fukao et al., 2007; Mikkelsen et al., 2007; Zhou et al., 2007; Barrera et al., 2008). To identify systematically transcriptional start sites for miRNA genes in the mouse and human genomes, we took advantage of the recent observation that histone H3 is tri-methylated at its lysine 4 residue (H3K4me3) at the transcriptional start sites of most genes in the genome, even when genes are not productively transcribed, and knowledge that this covalent modification is restricted to sites of transcription initiation (Barski et al., 2007; Guenther et al., 2007). We used the genomic coordinates of the H3K4me3 enriched loci derived from multiple cell types (Table S5, Barski et al., 2007; Guenther et al., 2007; Mikkelsen et al., 2007) to create a library of candidate transcription start sites in both human and mouse (Figures (Figures22 and S5).
High-confidence promoters were identified for over 80% of miRNAs in both mouse and human (Figures (Figures2,2, S5 and Table S6 and Table S7). These promoters were associated with 185 murine primary microRNA transcripts (pri-miRNAs) (specifying 336 mature miRNAs), and 294 human pri-miRNAs (specifying 441 mature miRNAs) (Table S6 and Table S7). To identify promoters for miRNA genes, the association of candidate transcriptional start sites with regions encoding mature miRNAs was scored based on proximity to annotated mature miRNA sequences (Landgraf et al., 2007), available EST data, and conservation between species (Figures (Figures2A,2A, S5 and S6 and Supplementary Text).
Five lines of evidence indicated that this approach identified genuine transcriptional start sites for miRNA genes. Existing EST data provided evidence that the predicted transcripts do in fact originate at the identified start sites and continue through the annotated loci of mature miRNAs (Figures (Figures2B,2B, S5 and S6). In addition to the chromatin signature of promoters, a high fraction of these regions contained CpG islands, a DNA sequence element often associated with promoters (Figure 2B and Table S6 and Table S7). Third, in some instances where evidence of primary miRNA transcripts, which may be present only transiently before processing, was not available in published databases at the identified transcriptional start sites, chromatin marks associated with transcriptional elongation including nucleosomes methylated at H3 lysine 36 (H3K36me3) and H3 lysine 79 (H3K79me2), provided evidence that such transcripts are actively produced (Figure 2C and Mikkelsen et al., 2007). Fourth, most miRNA promoters showed evidence of H3K4me3 enrichment in multiple tissues, as observed at the promoters of most protein-coding genes (Barski et al., 2007; Guenther et al., 2007; Heintzman et al., 2007) (Figure 2D). Finally, there was a high degree of correlation (8/10) between the identified miRNA transcriptional start sites and those that have mapped previously by other methods (Supplementary Text).
The binding sites of the ES cell transcription factors Oct4, Sox2, Nanog and Tcf3 were next mapped to these high-confidence miRNA promoters (Figure 3). In murine ES cells, Oct4, Sox2, Nanog, and Tcf3 co-occupied the promoters for 55 distinct miRNA transcription units, which included three clusters of miRNAs that are expressed as large polycistrons, thus suggesting that these regulators have the potential to directly control the transcription of 81 distinct mature miRNAs (Figure 3A and Table S6). This set of miRNAs occupied by Oct4/Sox2/Nanog/Tcf3 represented roughly 20 percent of annotated mammalian miRNAs, similar to the ~20 percent of protein-coding genes that were bound at their promoters by these key transcription factors (Table S2).
To determine if transcription factor occupancy of miRNA promoters is conserved across species, we performed genome-wide location analysis for Oct4 in human ES cells using microarray-based analysis. We found extensive conservation of the set of miRNA genes that were occupied at their promoters by Oct4, as exemplified by the mir-302 cluster (Figures 3A, 3B and Table S7 and Table S8). Transcription factor occupancy does not mean necessarily that the adjacent gene is regulated by that factor; conserved transcription factor occupancy of a promoter, however, suggests gene regulation by that factor. Thus, our data identified a set of miRNA genes that are bound at their promoters by key ES cell transcription factors in mouse and human cells (Figure 3C), suggesting that core ES cell transcription factor occupancy of these particular miRNA genes has functional significance.
The dependence of Oct4/Sox2/Nanog/Tcf3-bound miRNA genes on the core ES cell transcription factors was assessed by examining changes in miRNA expression following perturbation of individual transcription factors. First, we studied the effects of Oct4 depletion in ZHBTc4 ES cells, which allow for conditional repression of Oct4 with doxycycline treatment (Niwa et al., Nature Genetics 2000). miRNA expression was examined at 12 and 24 hours after doxycycline treatment, when Oct4 was silenced but the cells remained ES-like morphologically and still expressed Sox2 (Figure S7). Effects on miRNA regulation were measured globally using quantitative sequencing of short RNAs (18–30 nucleotides) in these cells (Morin et al., 2008). Although mature miRNAs have exceptionally long half-lives, small reductions in the levels of the majority of Oct4/Sox2/Nanog/Tcf3-bound miRNAs were observed (Figure S7E). To measure more directly the transcriptional activity of Oct4/Sox2/Nanog/Tcf3-occupied miRNA genes, the levels of primary miRNA transcripts were assessed by quantitative PCR (Figure 3D and S7). All five of the primary miRNAs examined were reduced significantly upon Oct4 depletion (Figure 3D).
Tcf3 has been shown to repress the expression of key pluripotency protein-coding genes in ES cells under standard culture conditions (Cole et al., 2008; Tam et al., 2008; Yi et al., 2008). Following shRNA-mediated knockdown of Tcf3 in ES cells, qPCR revealed that levels of primary transcripts for ES cell miRNA genes mir-290-295 and the mir-302 cluster were elevated, though this effect was modest (Figure S8). In summary, the observation that these key ES cell miRNAs are generally down-regulated in response to Oct4 depletion and up-regulated upon Tcf3 depletion, as would be predicted based on published effects of these factors on protein-coding genes, demonstrates that the core transcriptional circuitry of ES cells can play a functional role in the regulation of miRNA genes.
Oct4 and Nanog are silenced as ES cells begin to differentiate (Chambers and Smith, 2004; Niwa, 2007). If the Oct4/Sox2/Nanog/Tcf3 complex is required for activation or repression of its target miRNAs, the targets should be differentially expressed when ES cells are compared to a differentiated cell-type. To test this hypothesis, Solexa sequencing of 18–30 nucleotide transcripts in ES cells, mouse embryonic fibroblasts (MEFs), and neural precursors (NPCs), was performed to obtain quantitative information on the abundance of miRNAs in pluripotent cells relative to two differentiated cell types (Figure 4A and Table S1). In each cell type examined, a small subset of miRNAs predominated, with pronounced changes in miRNA abundance observed among the cell types (Supplementary Text).
When the abundance of all miRNAs bound by Oct4/Sox2/Nanog/Tcf3 in ES cells was examined in the three cell types, approximately half of the miRNAs dropped more than an order of magnitude in abundance in MEFs and NPCs relative to ES cells, as predicted (Figure 4B). A small number of the Oct4/Sox2/Nanog/Tcf3-occupied miRNAs, which will be further discussed below, were scarce in ES cells and showed increased abundance in MEFs and NPCs.
Oct4/Sox2/Nanog/Tcf3-occupied miRNAs were, in general, preferentially expressed in embryonic stem cells, as demonstrated by the analysis shown in Figure 4C. Whereas most miRNAs are unchanged in expression in ES cells relative to MEFs or NPCs, a significant portion of Oct4/Sox2/Nanog/Tcf3 occupied miRNAs are 100 fold more abundant in ES cells than in MEFs (p<5 × 10−15), and 1,000 fold more abundant in ES cells than in NPCs (p<5 × 10−9). This group of Oct4/Sox2/Nanog/Tcf3 bound miRNAs that was significantly more abundant in ES cells than in NPCs and MEFs, was also found to be actively expressed in induced pluripotent stem (iPS) cells (generated as described in Wernig et al., 2007), at levels comparable to those in ES cells (Figure S9). This is consistent with our evidence (Figures (Figures3,3, S7 and S8) that core ES cell transcription factors regulate the expression of these miRNAs in pluripotent cells.
Previous studies have revealed that core ES cell transcription factors occupy a set of transcriptionally active genes, but also occupy, with Polycomb Group proteins, a set of transcriptionally repressed genes that are poised for expression upon cellular differentiation (Lee et al., 2006; Bernstein et al., 2006; Boyer et al., 2006). As noted above, Oct4/Sox2/Nanog/Tcf3 similarly occupied a group of miRNA genes that were transcriptionally inactive in ES cells, but were activated selectively in particular differentiated cell types (Figure 4B). We reasoned that Polycomb complexes might also co-occupy Oct4/Sox2/Nanog/Tcf3-occupied miRNA genes that are inactive in ES cells. Indeed, new ChIP-seq data for the Polycomb Group protein Suz12 in murine ES cells supported this hypothesis (Figure 5A and Table S6, Table S7, Table S10). As expected, these promoters were also enriched for nucleosomes with histone H3K27me3, a chromatin modification catalyzed by Polycomb Group proteins (Figure 5A and Table S6 and Mikkelsen et al., 2007). In keeping with the repressive function of the Polycomb Group proteins reported at protein coding genes, miRNAs occupied at their promoters by Suz12 in ES cells were significantly less abundant in ES cells compared to all other miRNAs (Figure 5B). Approximately one quarter of the Oct4/Sox2/Nanog/Tcf3-occupied miRNAs belonged to the repressed set of miRNA genes bound by Suz12 in murine ES cells (Table S6 and Table S7).
To further examine the behavior of this set of miRNAs during embryonic cell-fate commitment, we returned to our quantitative sequencing data of short transcripts in ES cells, MEFs and NPCs (Figure 5C). Notably, miRNAs that were bound by Polycomb Group proteins in ES cells were among the transcripts that were specifically induced in each of the differentiated cell types. For example, transcript levels of miR-9, a miRNA previously identified in neural cells and which promotes neural differentiation (Lagos-Quintana et al., 2002; Krichevsky et al., 2006), were significantly elevated in NPCs relative to ES cells, but this miRNA remained scarce in MEFs. Similarly, miR-218 and miR-34b/34c expression was induced in MEFs, but remained at low levels in NPCs (Figure 5C). Consistent with Polycomb-mediated repression of these lineage-specific miRNAs, the repressive chromatin mark deposited by Polycomb Group proteins, H3K27me3, was selectively lost at the promoters of the miRNAs in the cells in which they were induced (Figure 5C and Mikkelsen et al., 2007).
The tissue-specific expression pattern of miRNAs repressed by Polycomb in ES cells is consistent with these miRNAs serving as determinants of cell-fate decisions in a manner analogous to the developmental regulators whose genes are repressed by Polycomb in ES cells (Lee et al., 2006; Bernstein et al., 2006; Boyer et al., 2006). Such a function in cell-fate determination would require that these miRNAs remain silenced in pluripotent ES cells. Indeed, the miRNAs that were repressed in ES cells by Polycomb Group proteins appear to be induced, later in development, in a highly restricted subset of differentiated tissues specific to each miRNA (Figure S10), unlike the majority of miRNAs identified in mouse (Landgraf et al., 2007). The miRNAs with promoters bound by Polycomb Group proteins in ES cells were significantly enriched (p<0.005) among the set of the most tissue-specific mammalian miRNAs (Figure S10 and Landgraf et al., 2007). This suggests a model whereby Polycomb Group proteins repress a set of tissue-specific miRNA genes in ES cells, a subset of which are co-occupied by Oct4, Sox2, Nanog and Tcf3 (Figure 5D).
Here we provide new high-resolution, genome-wide maps of core ES cell transcription factors, identify promoter regions for most miRNA genes, and deduce the association of the ES cell transcription factors with these miRNA genes. We also provide quantitative sequence data of short RNAs in ES cells, NPCs and MEFs to examine changes in miRNA transcription. The key transcriptional regulators in ES cells collectively occupied the promoters of many of the miRNAs that were most abundant in ES cells, including those that were down-regulated as ES cells differentiate. In addition, these factors also occupied the promoters of a second, smaller set of miRNAs that were repressed in ES cells and were selectively expressed in specific differentiated cell types. In ES cells, this second group of miRNAs were co-occupied by Polycomb group proteins, which are also known to silence key lineage-specific, protein-coding developmental regulators. Together these data reveal two key groups of miRNAs that are direct targets of Oct4/Sox2/Nanog/Tcf3: one group of miRNAs that is preferentially expressed in pluripotent cells, and a second, Polycomb-occupied group that is silenced in ES cells and is poised to contribute to cell fate-decisions during mammalian development.
Several miRNA polycistrons, which encode the most abundant miRNAs in ES cells and which are silenced during early cellular differentiation (Houbaviy et al., 2003; Suh et al., 2004; Houbaviy et al., 2005), were occupied at their promoters by Oct4, Sox2, Nanog and Tcf3. The most abundant in murine ES cells was the mir-290-295 cluster, which contains multiple mature miRNAs with seed sequences similar or identical to those of the miRNAs in the mir-302 cluster and the mir-17-92 cluster. miRNAs with the same seed sequence also predominate in human embryonic stem cells (Laurent et al., 2008). miRNAs in this family have been implicated in cell proliferation (O’Donnell et al., 2005; He et al., 2005; Voorhoeve et al., 2006), consistent with the impaired self-renewal phenotype observed in miRNA-deficient ES cells (Kanellopoulou et al., 2005; Murchison et al., 2005; Wang et al., 2007). Additionally, the zebrafish homologue of this miRNA family, mir-430, contributes to the rapid degradation of maternal transcripts in early zygotic development (Giraldez et al., 2006), and mRNA expression data suggests that this miRNA family also promotes the clearance of transcripts in early mammalian development (Farh et al., 2005).
In addition to promoting the rapid clearance of transcripts as cells transition from one state to another during development, miRNAs also likely contribute to the control of cell identity by fine-tuning the expression of genes. mir-430, the zebrafish homologue of the mammalian mir-290-295 family, serves to precisely tune the levels of Nodal antagonists Lefty1 and Lefty 2 relative to Nodal, a subtle modulation of protein levels that has pronounced effects on embryonic development (Choi et al., 2007). Recently, a list of ~250 murine ES cell mRNAs that appear to be under the control of miRNAs in the mir-290-295 cluster was reported (Sinkkonen et al., 2008). This study reports that Lefty1 and Lefty2 are evolutionarily conserved targets of the mir-290-295 miRNA family. These miRNAs also maintain the expression of de novo DNA methyltransferases 3a and 3b (Dnmt3a and Dnmt3b), perhaps by dampening the expression of the transcriptional repressor Rbl2, helping to poise ES cells for efficient methylation of Oct4 and other pluripotency genes during differentiation.
Knowledge of how the core transcriptional circuitry of ES cells connects to both miRNAs and protein-coding genes reveals recognizable network motifs downstream of Oct4/Sox2/Nanog/Tcf3, involving both transcriptional and post-transcriptional regulation, that provide new insights into how this circuitry controls ES cell identity (Figure 6). Lefty1 and Lefty2, both actively expressed in ES cells, are directly occupied at their promoters by Oct4/Sox2/Nanog/Tcf3. mir-290-295, which is also directly occupied by Oct4/Sox2/Nanog/Tcf3, depends on Oct4 for proper expression (Figure 3D). Therefore, core ES cell transcription factors appear to promote the active expression of Lefty1 and Lefty2, but also fine-tune the expression of these important signaling proteins by activating a family of miRNAs that target the Lefty1 and Lefty2 3’UTRs. This network motif whereby a regulator exerts both positive and negative effects on its target, termed “incoherent feed-forward” regulation (Alon, 2007), provides a mechanism to fine-tune the steady-state level or kinetics of a target’s activation (Figure 6A). Over a quarter of the proposed targets of the mir-290-295 miRNAs (Sinkkonen et al., 2008) are likely under the direct transcriptional control of Oct4/Sox2/Nanog/Tcf3 based on our binding maps, suggesting that these miRNAs could participate broadly in tuning the effects of ES cell transcription factors (Figure 6A and Supplementary Text).
The miRNA expression program directly downstream of Oct4/Sox2/Nanog/Tcf3 could help poise ES cells for rapid and efficient differentiation, consistent with the phenotype of miRNA-deficient cells (Kanellopoulou et al., 2005; Murchison et al., 2005; Wang et al., 2007). Oct4/Sox2/Nanog/Tcf3 likely contributes to this poising by their occupancy of the Let-7g promoter. Mature Let-7 transcripts are scarce in ES cells, but were among the most abundant miRNAs in both MEFs and NPCs (Figure 3). Primary Let-7g transcript is abundant in ES cells, but its maturation is blocked by Lin28 (Viswanathan et al., 2008). We now report that the promoters of both Let-7g and Lin28 are occupied by Oct4/Sox2/Nanog/Tcf3, suggesting that the core ES cell transcription factors promote the transcription of both primary Let-7g and Lin28, which blocks the maturation of Let-7g. Indeed, proper expression of pri-Let-7g is dependent on Oct4 (Figure 3D). In this way Let-7 and Lin-28 participate in an incoherent feed-forward circuit downstream of Oct4/Sox2/Nanog/Tcf3 to contribute to rapid cellular differentiation (Figure 6B). Notably, ectopic expression of Lin28 in human fibroblasts promotes the induction of pluripotency (Yu et al., 2007), suggesting blocked maturation of pri-Let-7 transcripts plays an important role in the pluripotent state. Additionally, Dnmt3a and Dnmt3b, which are indirectly up-regulated by the mir-290-295 miRNAs (Sinkkonen et al., 2008), are also occupied at their promoters by Oct4/Sox2/Nanog/Tcf3, providing examples of “coherent” regulation of important target genes by ES cell transcription factors and the ES cell miRNAs maintained by those transcription factors (Figure 6C).
The regulatory circuitry we present for miRNAs in ES cells can now be integrated into the model of core regulatory circuitry of pluripotency we have proposed previously (Boyer et al., 2005; Lee et al., 2006; Cole et al., 2008), as illustrated in Figure 7. Our data reveal that Oct4, Sox2, Nanog and Tcf3 occupy the promoters of two key sets of miRNAs, similar to the two sets of protein-coding genes regulated by these factors: one set that is actively expressed in pluripotent ES cells and another that is silenced in these cells by Polycomb Group proteins and whose later expression might serve to facilitate establishment or maintenance of differentiated cell states.
The expanded circuit diagram presented here integrates transcription factor occupancy of miRNA genes and existing data on miRNA targets into our model of the molecular control of the pluripotent state. These data suggest that miRNAs that are activated in ES cells by Oct4/Sox2/Nanog/Tcf3, serve to modulate the direct effects of these transcription factors, participating in incoherent feed-forward regulation to tune levels of key genes, and modifying the gene expression program to help poise ES cells for efficient differentiation. Thus, the core ES cell transcription factors and the miRNAs under their control coordinately contribute transcriptional and post-transcriptional gene regulation to the network that maintains ES cell identity.
Knowledge of how protein-coding genes are controlled by key ES cell transcription factors and chromatin regulators has provided important insights into the molecular control of ES cell identity and cellular reprogramming (Jaenisch and Young, 2008). This knowledge also has begun to shed light on human disease, as elements of the ES cell gene expression program are recapitulated in cancer cells (Wong et al., 2008; Ben-Porath et al., 2008). We now connect miRNA genes to the core circuitry of ES cells with high-resolution genome-wide ChIP-seq data, and quantitative sequencing of short transcripts in multiple cell types. This information should prove useful as investigators continue to probe the role of miRNAs in pluripotency, cell state, disease and regenerative medicine.
A detailed description of all materials and methods used can be found in Supplementary Text.
V6.5 (C57BL/6-129) murine ES cells were grown under typical ES conditions (see Supplementary Text) on irradiated mouse embryonic fibroblasts (MEFs). For location analysis, cells were grown for one passage off of MEFs, on gelatinized tissue-culture plates. Neural precursor cells (NPCs) derived from V6.5 ES cells and mouse embryonic fibroblasts prepared and cultured from DR-4 strain mice were grown using standard protocols as previously described (See Supplementary Text). ZHBTc4 cells harboring a doxycycline-repressible Oct4 allele (Niwa et al., 2000), a gift from A. Smith, were cultured under standard ES cell conditions on gelatin. Cultures were treated with 2µg/ml doxycycline (SIGMA, D-9891) for 12hrs or 24hrs.
Detailed descriptions of antibodies, antibody specificity and ChIP methods used in this study have been published previously and are provided in Supplementary Text. Purified immunoprecipitated DNA was prepared for sequencing according to a modified version of the Solexa Genomic DNA protocol. Fragmented DNA was end repaired and subjected to 18 cycles of linker-mediated (LM)-PCR using oligos purchased from Illumina. Amplified fragments between 150 and 300bp were isolated by agarose gel electrophoresis and purified. High quality samples were confirmed by the appearance of a smooth smear of fragments from 100–1000bp with a peak distribution between 150 and 300bp. 3ng of linker-ligated DNA was applied to the flow-cell using the Solexa Cluster Station fluidics device. Samples were then subjected to 26 bases of sequencing according to Illumina’s standard protocols.
Images acquired from the Solexa sequencer were processed through the bundled Solexa image extraction pipeline and aligned to both mouse NCBI build 36 and 37 using ELAND. Analysis is described in Supplementary Text. All sequence data is available at the NCBI GEO database under the accession designation GSE11724.
The design of the oligonucleotide-based whole genome array set and data extraction methods are described in Lee et al., 2006. The microarrays used for location analysis in this study were manufactured by Agilent Technologies (http://www.agilent.com). All array data is available at Array Express under the accession designation E-MEXP-1687.
A method of cloning the 18–30nt transcripts previously described (Lau et al., 2001) was modified to allow for Solexa (Illumina) sequencing (manuscript submitted). RNA extraction was performed using Trizol, followed by RNeasy purification (Qiagen) for mES cells, MEFs and NPCs. Single-stranded cDNA libraries of short transcripts were generated using size selected RNA from mES cells NPCs, MEFs and ZHBTc4 cells (See Supplementary Text). Single-stranded DNA samples were resuspended in 10mM Tris (EB buffer)/0.1% Tween and used as indicated in the standard Solexa sequencing protocol (Illumina). All sequence data is available at to the NCBI GEO database under the accession designation GSE11724.
Real time PCR primers were designed using the standard specifications of PrimerExpress (Applied Biosystems) to amplify regions within the ~200nt immediately upstream of the tested miRNA hairpins or in the middle of mir-290-295 polycistron, but outside of any miRNA hairpin regions (Supplementary Text and Figure S8B). Primers were used in SYBR Green quantitative PCR assays on the Appied Biosystems 7500 Real Time PCR system. Expression levels were calculated relative to Gapdh mRNA levels, which were quantified in parallel by Taqman analysis. Detailed methods and primer sequences can be found in Supplementary Information.
We thank members of the Young, Jaenisch and Bartel laboratories, especially T. Lee, for discussions and critical review of the manuscript. We also thank A. Ravi and A. Seila for their help and A. Smith for providing the ZHBT-c4 cells. We are grateful to L.A. Boyer, B. Chevalier, R. Kumar, and T. Lee who were instrumental in performing location analysis in hES cells. We also thank Biology and Research Computing (BaRC), especially Tom DiCesare, for computational and technical support and graphic assistance. This work was supported in part by NIH grants 5-RO1-HDO45022, 5-R37-CA084198, and 5-RO1- CA087869 to R.J. and by NIH grant HG002668 and a grant from the Whitehead Institute to R.A.Y, and partially by a Cancer Center Support (Core) grant P30-CA14051 from the NCI.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.