Cellular reprogramming demonstrates the remarkable plasticity of cell fates, illustrated by the isolation of induced pluripotent stem cells (iPSCs) from fibroblasts6-9
. Molecular analysis of epigenetic modifications has revealed a near complete remodeling of the epigenome during reprogramming1-4,12
, resulting in the conversion of lineage-specific to uniform protein-coding and miRNA gene expression profiles similar to embryonic stem cells (ESCs)2,6-9
. We and others have recently discovered a novel class of large intergenic non-coding RNAs (lincRNAs) that are expressed in a cell type-specific manner13
and can associate with epigenetic regulators11,14-16
involved in pluripotency and lineage commitment17,18
To date, it is not known whether large-scale transcriptional changes induced by reprogramming apply to lincRNAs, and if these changes have any functional relevance. To test this, we compared the transcriptional profiles of human lincRNAs alongside protein-coding genes across fibroblasts, their derivative iPSCs, and ESCs. We reprogrammed four primary fibroblast lines7
and validated the functionality of resulting iPSC lines (Supplementary Figure 1, 2
, and data not shown). We then performed DNA microarray analysis of the parental fibroblasts, seven of their derivative iPSC lines, and two ESC lines. Consistent with previous studies, analysis of gene expression profiles revealed that all iPSCs were similar to ESCs19,20
, and distinct from fibroblasts (, Supplementary Figure 3
). We detected 3694 and 3283 genes up- and downregulated, respectively, in iPSCs and ESCs compared with fibroblasts (>2fold, P<0.05; ). Taken together, our fibroblast-derived iPSCs fulfill functional criteria of bona fide
and exhibit a uniform protein-coding gene expression profile similar to ESCs.
Figure 1 Direct reprogramming of fibroblasts converts both protein-coding gene and lincRNA expression to a puripotent cell-specific profile. A, C) Unsupervised hierarchical clustering of protein-coding gene expression (A) and lincRNA expression (C) segregates (more ...)
To explore the expression of lincRNAs, we designed a microarray probing ~900 lincRNAs in the human genome11
and analyzed their expression in the above cell lines. The global lincRNA expression profiles of iPSCs were very similar to ESCs and distinct from fibroblasts (). We observed 133 and 104 lincRNAs that were induced or repressed (>2fold, FWER<0.05) across all iPSCs and ESCs compared with fibroblasts (, Supplementary Table 1
). Similar to protein-coding genes, direct reprogramming resulted in concomitant activation or repression of numerous lincRNAs consistent with a reactivation of the ESC state.
To exclude the possibility that reprogramming-induced changes in lincRNA expression reflects the opening and closing of chromatin domains of neighboring protein-coding genes, we analyzed the correlation of expression between each reprogrammed lincRNA and their neighboring genes, and found no significant correlation (P=0.999, ). This indicates an independent and cell type-specific regulation of lincRNA expression.
We sought to identify lincRNAs with potentially important functions in ESCs/iPSCs. Among the many pluripotency-associated lincRNAs ( and Supplementary Figure 4
), we searched for those that were expressed in both ESCs and iPSCs, but showed elevated levels in iPSCs relative to ESCs, reasoning that their high expression may have conferred a selective advantage on emerging iPSCs. We identified 28 lincRNAs that showed greater expression in fibroblast-iPSCs relative to ESCs (>2fold, FWER<0.05; ), and refer to these as “iPSC-enriched” lincRNAs hereafter.
Figure 2 Several lincRNAs show enriched expression in iPSCs compared with ESCs. A) Heatmap of 28 and 52 lincRNAs that are more highly expressed in fibroblast-derived iPSCs (left) and CD34+-derived iPSCs (right), respectively, compared with ESCs (>2fold, (more ...)
We hypothesized that if iPSC-enriched lincRNAs are important for reprogramming, they should be elevated in iPSCs independent of the cell-of-origin. To test this, we profiled lincRNA expression in CD34+
hematopoietic stem/progenitor cells, two CD34+
and ESCs, using the same approach as above. Like fibroblast-iPSCs, CD34+
-iPSCs had similar global lincRNA expression profiles as ESCs, distinct from that of CD34+
cells (Supplementary Figure 4
). 10 of the 28 lincRNAs elevated in fibroblast-iPSCs were also elevated in CD34+
-iPSCs (, Supplementary Figure 5
). This overlap was statistically significant (P<0.0001). We independently validated levels of 8/10 common iPSC-enriched lincRNAs by qRT-PCR (), and detected considerable variation in expression. Positive selection for minimal RNA levels and the absence of counter-selection against higher expression during reprogramming may be the cause for this variability. Collectively, these results demonstrate that numerous lincRNAs are tightly associated with the pluripotent state, including a subset that is consistently enriched in iPSCs independent of the cell-of-origin.
If iPSC-enriched lincRNAs are important for iPSC derivation, we suspected a link with the pluripotency network. To test this, we first intersected published OCT4 binding regions in ESCs22
with iPSC-enriched lincRNA loci (demarcated by domains of histone H3K4 and H3K36 methylation10,11
, named according to their neighboring 3’ gene), and identified three overlapping loci: lincRNA-SFMBT2, lincRNA-VLDLR, and lincRNA-ST8SIA3. We performed independent ChIP-qPCR to validate the binding of OCT4, and probe for SOX2 and NANOG occupancy at these sites. All three transcription factors occupied these regions, coinciding with or in close proximity to lincRNA promoters (peaks of H3K4me10
; , Supplementary Figure 6
Figure 3 Transcriptional regulation of iPSC-enriched lincRNAs. A) iPSC-enriched lincRNA loci are bound by pluripotency transcription factors. Top: lincRNA loci demarcated by domains enriched in histone H3K4me3 indicating RNA polymerase II promoters and H3K36me3 (more ...)
To determine whether expression of iPSC-enriched lincRNAs is dependent on pluripotency transcription factors, we depleted OCT4 in iPSCs and ESCs using siRNAs and monitored the levels of iPSC-enriched lincRNAs. We verified OCT4 knock-down and induction of the differentiation marker LMNA
(, Supplementary Figure 7
). Levels of all three iPSC-enriched lincRNAs fell within 72h (, Supplementary Figure 7C
). To further verify that downregulation of iPSC-enriched lincRNAs is caused by perturbation of the pluripotency network, we induced embryoid body (EB) formation as a distinct pathway of differentiation. Again, levels of all three iPSC-enriched lincRNAs fell within two days (, Supplementary Figure 7D
). The expression of these lincRNAs thus appears controlled by pluripotency transcription factors in ESCs and iPSCs.
We turned to investigate the functional roles of iPSC-enriched lincRNAs in the reprogramming process. To this end, we generated shRNA-expressing lentiviruses targeting lincRNA-ST8SIA3 and lincRNA-SFMBT2, which showed the strongest response to EB differentiation and OCT4 knock-down, and validated each knock-down relative to a non-targeting control shRNA (, Supplementary Figure 8A
). To test the effect of lincRNA depletion on reprogramming, we infected dH1f fibroblasts7
with both the shRNA-expressing and the reprogramming viruses7,9
, and scored emerging iPSC colonies based on Tra-1-60 marker expression (day 21)20
. Interference with lincRNA-SFMBT2 did not affect iPSC colony formation (Supplementary Figure 8B, C
), suggesting that lincRNA-SFMBT2 is not essential, or alternatively, that its moderate reduction was insufficient to perturb reprogramming. In contrast, knock-down of lincRNA-ST8SIA3 resulted in a significant 2 to 8-fold decrease of iPSC colonies relative to the control, where as progenitor cells were unaffected (P<0.01; , Supplementary Table 2
). As expected, resulting iPSC colonies fulfilled additional criteria of fully reprogrammed cells (Supplementary Figure 9
). These results demonstrate a functional requirement of lincRNA-ST8SIA3 expression for iPSC derivation.
Figure 4 LincRNA-ST8SIA3 expression modulates reprogramming. A) qRT-PCR verifies lincRNA-ST8SIA3 knock-down with Linc-sh1 and Linc-sh2 in hFib2-iPS5 cells relative to a non-targeting shRNA control (n=2, error bar: +/-s.e.m). B) Quantification of Tra-1-60+ iPSC (more ...)
Several studies have established critical roles of cell proliferation and bypass of senescence during the early stages of reprogramming23-27
. We therefore examined if knockdown of lincRNA-ST8SIA3 compromised cell growth of fibroblasts or cells during this window, and failed to detect significant differences in cells infected with the lincRNA-ST8SIA3-targeting virus compared with the control (, Supplementary Figure 10
). In addition, the kinetics of reprogramming upon knock-down of lincRNA-ST8SIA3 was similar to the control (Supplementary Figure 11
). Collectively, these findings point to a specific inhibition of the reprogramming process, rather than a delay of iPSC formation upon loss of lincRNA-ST8SIA3.
Intrigued by this phenotype, we used 5’ and 3’ rapid amplification of cDNA ends to clone the full-length transcript of lincRNA-ST8SIA3 (), which recovered a 2.6kb long RNA comprised of four exons (, red). We did not detect any clones that were spliced to protein-coding genes nor intact open reading frames, and confirmed the presence of a single transcript of expected length by Northern Blotting (, Supplementary Figure 12
We next used a complementary gain-of-function approach to test whether elevated lincRNA-ST8SIA3 expression might enhance reprogramming. We infected dH1fs with empty pBabe-puro retrovirus, GFP-expressing, or lincRNA-ST8SIA3-expressing virus, selected transgenic cells, and documented 25 to 70-fold overexpression of lincRNA-ST8SIA3 relative to the levels in H9 ESCs (). We induced reprogramming in these stable cell lines and consistently observed a more than 2-fold increase in iPSC colony formation (day 28 +/-2 days) (P<0.001; ). This was not associated with significant changes in cell growth of fibroblasts or cells at early stages of reprogramming (, Supplementary Figure 10
). Thus, overexpression of lincRNA-ST8SIA3 positively affects the establishment of iPSCs during reprogramming (), in addition to possible functions in iPSC maintenance. Supporting the latter, transient knock-down of lincRNA-ST8SIA3 in ESCs and established iPSCs resulted in a growth deficiency linked with elevated apoptosis (Supplementary Figure 13
To gain insight into which cellular pathways are affected by lincRNA-ST8SIA3 knock-down, we performed microarray gene expression analysis. Consistent with its apoptotic phenotype, knock-down of lincRNA-ST8SIA3 led to upregulation of genes involved in the p53 response, the response to oxidative stress and DNA damage-inducing agents, as well as cell death pathways (Supplementary Table 3
). Interestingly, simultaneous knock-down of p53 partially rescued the apoptotic phenotype caused by ablation of lincRNA-ST8SIA3 (Supplementary Figure 14
). Taken together, these results suggest that lincRNA-ST8SIA3 plays a role in promoting survival in iPSCs and ESCs, likely by preventing the activation of cellular stress pathways including the p53 response.
Our transcriptional profiling approach has revealed numerous lincRNAs that are part of the transcriptional repertoire of human ESCs and are induced during reprogramming of different cell types. We have identified several iPSC-enriched lincRNAs that appear to be directly regulated by the pluripotency network. Interestingly, we found no direct syntenic correlates of the 10 iPSC-enriched lincRNAs expressed in mouse ESCs (with the exception of lincRNA-VLDLR). Similar to what has been described for protein-coding genes28
, the transcriptional networks of lincRNAs in ESCs may have become rewired, conferring species-specific regulation.
The modulation of reprogramming by lincRNA-ST8SIA3 provides the first functional example of a lincRNA in establishing iPSCs, we therefore name it lincRNA-RoR for R
eprogramming. Future studies will be required to decipher the molecular mechanism by which lincRNA-RoR acts, and to gain a global understanding of lincRNA function in the establishment and maintenance of pluripotency. One possibility is that pluripotency-associated lincRNAs interface with chromatin modifying complexes to assist in the regulation of the distinct epigenetic architecture in pluripotent cells. Supporting this, previous studies have demonstrated critical roles for chromatin-modifying complexes in the establishment and maintenance of pluripotency, and numerous lincRNAs can interact with these complexes to impart target specificity11,15,16
. Here we demonstrate the modulation of reprogramming by a large non-coding RNA, supporting the notion that lincRNAs represent an additional layer of complexity in the networks controlling cellular identity.