Characteristics of reprogrammed human cell lines
We first examined tumor xenografts from reprogrammed clones (
Supplementary Table 1a), for pluripotent potential and neoplastic features. The highest cancer potential was for clone MP4, a fibroblast line induced by lentiviral introduction of
OCT4, SOX2, NANOG, LIN28 and
SV40 T-antigen(OSNLT)(1, 17), and known to be only partially reprogrammed (
17). MP4 cells express undifferentiated markers such as OCT4, but not TRA-1–60, are refractory to differentiation induction
in vitro and
in vivo (i.e., nullipotent), and possess an abnormal karyotype. Mouse xenografts demonstrate high nuclear to cytoplasmic ratio, an extremely high mitotic rate, areas of necrosis, and the histology of a primitive, aggressive, mesenchymal tumor (). In comparison, clone MP2, prepared identically to MP4 and a typical iPS expressing TRA-1–60 and all classical iPS markers (
17), forms what appears to be a benign, multi-lineage teratoma (, upper right panel). However, this clone has an abnormal karyotype and upon closer examination, the teratoma contains small foci suggesting malignancy, including regions where cells infiltrate host skeletal muscle bundles (, lower right panel).
We then performed blinded comparison (BS and DMB) of teratomas ( and
Supplementary table 1b) from the conventional embryonic stem cell ( ESC) lines, H1, H9, and SC233 versus from multiple additional pluripotent iPS with normal karyotypes and metrics of fully reprogrammed human iPS (
23), including expression of the embryonic markers, AP, OCT4, and TRA-1–60. These include, MR45 and MR46, generated from fibroblasts by retrovirally introducing
OCT4, SOX2, KLF4 and c-MYC (OSKM)(2), MB41, MB45, MMW1, and MMW2 generated with the same retroviral vectors but from mesenchymal stem cells (MSCs), and two iPS lines (MR31 and MR32) retrovirally derived from IMR90 fibroblasts using 3 factors (OSK, without c-MYC). All formed mouse xenografts with differentiated cell types of multiple embryonic germ layers (). However, the successfully reprogrammed iPS cell xenografts show a range of degree of maturation (from .067 to .231 lineage structures/ areas examined) of defined structures such as cartilage, bone, intestine, etc. (), that is lower than the range of values (0.338–0.97) for three ESC derived teratomas ( –
Supplemental Table 1b). Most importantly, all iPS teratomas examined have foci with malignant-like characteristics which include focal necrosis, nuclear pleomorphism, aberrantly high mitotic rates, and infiltration into the mouse musculature (Examples shown in ;
Supplemental table 1b). Such foci were generally not found in the three ESC derived teratomas (
Supplemental table 1b), save for one small area of focal necrosis in the teratoma derived from H1 ESC.
Overall Gene Expression profiles of reprogrammed clones
We compared, using full transcriptome Agilent arrays, global gene expression between representative reprogrammed clones versus ESC’s and found, as have others (
10), highly similar, but not identical, patterns. Even the partially reprogrammed MP4 clone deviated only slightly from the pluripotent clones and ESC and clustered separately from four human cancer cell lines (
Supplementary Fig. 1). However, focus upon genes highly expressed in ESC revealed distinct differences. Compared to parent cells, pluripotent clones MR46 and MR45 significantly up-regulated to levels in ESC, 12 of 16 and 13 of 16 such genes, respectively (
Supplementary Table 3), MP2 increased only 11 of 16 above levels in parent fibroblasts, and only 8 to levels in ESC. The malignant, partially reprogrammed, MP4 properly increased only 5 of the 16 genes with respect to starting fibroblasts and only 6 reached levels in ESC. Also, partially reprogrammed clone, MP4, expressed both OCT4 and c-MYC, due to incomplete repression of introduced transgenes, significantly higher than in ESC (
Supplementary Table 3).
Abnormal gene silencing, and gene responses to epigenetic modifying agents, associated with cellular reprogramming
We next linked, in reprogrammed clones, gene expression to epigenetic aspects of neoplasia, with respect to abnormal gene silencing events during reprogramming. In pluripotent clones, MR46, MR45, and MP2, normal silencing events (, left panel) dominate over abnormal silencing events for both genes that should be silent in fibroblasts but activated in ESC, or which should be active in both cell types ( middle and right panels) while the opposite is true for the tumorigenic, MP4 cells ( –
Supplementary Table 4). However, while MP4 has the highest number of abnormal silencing events (~800), even the best performing iPS clone, MR46, has ~ 100 such genes (, combined;
Supplementary Table 4). These numbers may be characteristic of iPS since analysis of the best reprogrammed mouse iPS clone (MCV8.1, subclone 8.1(
24) from a recent study of Mikkelsen et. al. (
10) revealed 418 abnormally silenced genes (
Supplementary Fig. 2).
In cancer, DNA hypermethylation of CpG island promoters (
16) is a prime candidate for mediating abnormal gene silencing and several abnormally silenced genes in reprogrammed clones are known to have such changes including
CDKN2B (Suppl Ref. 1), LXN (Suppl Ref. 2, 3), TIMP3 (Suppl Refs. 2, 4), and PYCARD (Suppl Ref. 5). We thus investigated global gene expression responses to both the DNA demethylation agent, DAC and the histone deactylase inhibitor, TSA. DAC effectively re-expresses genes in cancer with densely hypermethylated, promoter CpG islands while TSA, alone, does not (
18)(see
supplemental Fig. 3 for array expression responses). The results strikingly separate cancer cells, iPS cells, and especially the partially reprogrammed MP4 clone, from parent fibroblasts, adult MSC, and ESC. Addition of either DAC or TSA () reactivates between 67% (MR46) and 84% (MR45) of the abnormal silenced genes in . A dramatic finding is that, in partially reprogrammed clone MP4, and to a lesser degree other iPS (), more induced silent genes are responsive only to DAC alone, or both DAC and TSA (left to right, ), but not to TSA alone and this is also true, to a slightly less extent (), for abnormal retained silencing genes (those in far right panel). This is true for both CpG island and non-CpG island containing genes ( respectively). The pattern for MP4 cells begins to resemble that for the cancer cell lines, which have an extraordinary dominance of DAC alone versus TSA alone, responsive genes. Moreoever, genes in normal ESC, adult MSC, and committed bone progenitor cells (osteoblasts), have a starkly dominant response to TSA alone (). These patterns also hold when~ 400 CpG island containing genes responsive to DAC alone in the U2OS osteosarcoma cells () and HT1080 fibrosarcoma cells () serve as their own controls - ie, for these genes, TSA responses dominate in normal cells, DAC responses dominate in cancer cells, and the reprogrammed clones show a mixed response to DAC and TSA consisting of a predominantly DAC response most apparent in the MP4 clone. Finally normal committed IMR90 fibroblasts have more of a mixture of DAC and TSA responsive genes, but with responses still skewed towards TSA () and the increased frequency of DAC responsive genes even separates all of the fully reprogrammed clones from ESC and adult MSC ().
Taken together, the data indicate that re-programmed somatic cells can acquire both abnormal gene silencing events and aberrant responses to epigenetic modifying drugs that deviate from normal and are reminiscent of changes observed in cancer cells.
Cancer-related promoter DNA methylation changes arise, early, during reprogramming
To further examine abnormal gene silencing and cancer specific promoter, DNA methylation on a global scale, we used the Infinium platform(
25) which queries ~27,000 CpG sites, most, but not all, in annotated, promoter, CpG islands. When focused on analyzing CpG’s located from −1,000 to +200bp from gene transcription start, we find, as have others in genome – wide DNA methylation studies (
10,
26,
27), that the vast majority (~ 90%), of ~ 7500 loci in ~ 5700 different genes with well annotated, promoter, CpG islands, are unmethylated in all normal cells (ESC, MSC, fibroblasts -
Supplemental Fig 4). In contrast, for ~3,750 probes in ~ 800 autosomal genes not containing dense CpG island promoters, CpG poor promoters are far more methylated, with a varied tissue pattern, in normal cells (
Supplemental Fig. 5), consistent with studies of others(
12,
26) and verifying a long held biological premise. Finally, consistent with our previous studies (
18), gene promoter CpG island hypermethylation in cancer is starkly apparent (
Supplemental Fig 4). The number of unmethylated CpG island promoters falls to ~ 78% in 4 adult cancer cell lines and ~ 87% in two germ line teratocarcinoma lines representing 900 hypermethylated genes in the former and over 200 in the latter. Non-CpG island promoters have not been as carefully examined in cancer but the cancer cell lines still cluster separately from normal cells largely because of multiple genes that have gained methylation but also due to many that have lost normal methylation (
Supplemental Fig. 5). Importantly, for cancer cell lines such as HCT 116 colorectal cells, the Infinium analyses correctly identified (data not shown) ~90% of genes validated in our laboratory to have cancer specific DNA hypermethylation(
18).
With this above background, we examined not only our reprogrammed cell clones, but also pools of cells (
Supplementary Table 1 for listing) from early stages (days 6 to 18) following insertion of four factors into both our fibroblast and MSC parent cells and after introduction of OCT4 alone into fibroblasts. In these pools, cells are in a dynamic, meta-stable, state with a wide range of reprogramming stages (
10,
28) wherein most clones do not exhibit ESC morphology and many cells can be cultured indefinitely. Overall, cells in these pools maintain the normally unmethylated state of CpG island promoters, harboring many less abnormalities than seen in cancer cells (
Supplemental Fig 4). However, we observe that among normally unmethylated CpG island genes, 50 show abnormal methylation in one or more of the early cell pools by day 6, and 38 have abnormally added methylation in individual reprogrammed clones ( -individual clone genes in and pool genes,
Supplemental table 5). While the genes generally differ between the pools and clones, ~15% are shared between the two. Of 10 clones examined, including partially reprogrammed MP4 cells, only 2 (MMW1 and MMW2) failed to exhibit a hypermethylated gene while the remainder contained between 3 (MR46) and 17 (MB45). Significantly, over 60% of all these genes (p = 3.055e-07; Fisher’s exact test) in the clones and 50% (p = 7.302e-05; Fisher’s exact test) in the pools are also hypermethylated in one or more cancer cell lines (; individual genes, and
Suppl. Table 5). Finally, and very importantly, even pooled cells with introduction of OCT4 alone contained from 12 to 16 hypermethylated genes, and, again, ~ 50% of these were also hypermethylated in at least one cancer cell line (;
Suppl. Table 5).
| Table 1Genes with abnormally methylated CpG island promoters in clones, partially reprogrammed pools, or cancers. Dark squares = beta values above 0.45; light gray squares = values ranging to 0.45. |
The fully reprogrammed, iPS clones generally also behave very well with respect to non-CpG island genes. Thus, for all clones, 87 to 95% of such promoters properly either gain or lose DNA methylation properly relative to ESC (). Interestingly, and as might be expected, most of the genes in these groups do not make the changes required for iPS in the reprogramming pools of fibroblasts or MSC generally retaining the methylation patterns of the starting parent cells (). Again, despite the global proper behavior, multiple non-CpG island genes, relative to normal cells, both gain and lose promoter methylation. Thirteen genes abnormally gain methylation in the clones (
Suppl Table 6), and another 13 in the pools (
Suppl Table 7) while 56 and 37 lose methylation respectfully (
Suppl Tables 6 and 7). Particularly, MP4, contains 37 abnormal genes, 5 having abnormal gain and 32, abnormal losses, of methylation (;
Suppl Tables 6 and 7). A high percentage, 60% for the clones (p = 1.645e-12; Fisher’s exact test), and 43% for the pools, (p = 5.141e-08), of the abnormal gains and losses of DNA methylation also appear in one or more of the cancer cell lines (
Suppl Tables 6 and 7). Finally, for MP4, only 35% of non-CpG island genes properly gained or lost DNA methylation relative to ESC.
Importantly, many of the above abnormal CpG island genes are hypermethylated in primary human cancers and have roles in malignancy and/ or embryonic cell fate patterning.
GATA4 (Suppl Ref 6), central to proper endodermal differentiation, is hypermethylated in multiple cancer types,
O6-MGMT(Suppl Refs 7,8) is also hypermethylated in cancer and loss of function impedes DNA repair,
TCERG1L has been recently identified as a low frequency mutated gene in colon cancer (
Suppl Ref 9) and is DNA hypermethylated in virtually all such tumors (unpublished data), SPRY2 plays a role in development and differentiation through negatively regulating the extracellular-regulated kinase pathway and is hypermethylated in cancers with prognostic significance(
Suppl Ref 10), SOX 1 is DNA hypermethylated in cancer and also plays key roles in development (
Suppl Ref 11),
BMP4, is an important morphogen for mesenchymal development and is hypermethylated in breast cancer (
Suppl Ref 12), and HOXA9 is an important cell pattern control gene frequently hypermethylated in breast cancer (
Suppl Ref 13).
LXN, hypermethylated in tumorigenic clone MP4, contains a weak CpG island and is hypermethylated in several cancer types (
Suppl Refs, 2, 3).
For selected of these important genes, we verified Infinium results by performing the methylation PCR assay, MSP(
29), querying 4 to 6 CpG sites positioned near the Infinium probes yielding hypermethylation values. Even though Infinium probes query small numbers of CpG’s in the islands, and development of abnormal methylation may be quite partial over the short time of re-programming, 7 of 10 genes were methylated by MSP, and at one of two promoter sites queried,
TCERG1L is fully methylated in iPS clones MP2 and MR45 (
Suppl Fig. 6a, b).
Abnormally silenced genes, chromatin, and cellular re-programming
The small fraction of DAC responsive (
supplemental Fig. 3), silenced, genes in iPS and reprogramming pools that are DNA methylated in reprogrammed cells is surprising. While, this could represent failure to detect very partial DNA methylation in queried promoters, it may involve links between abnormal gene silencing, DNA methylation in cancer, and chromatin states of embryonic cells. We(
30), and others(
31,
32), have found high percentages of DNA hypermethylated genes in cancer are marked by polycomb group silencing proteins (PcG) in embryonic cells. Such genes may have an abnormal progression from PcG marking to promoter DNA hypermethylation in cancer (
27,
30). In ESC, such promoters are not methylated, but rather maintained in a low, poised expression state by PcG regulation (
33–
35) and “bivalent” chromatin constituted by transcriptional positive (H3K4me2 and me3), and negative (the PcG mark, H3K27me3) histone modifications (
36). Interestingly, abnormally low expression genes in partially reprogrammed cells from mice have bivalent chromatin (
28)and aggressive human cancers have increased expression of PcG genes(
37) to levels such as those found in our iPS cells (
Supplementary Table 2).
We, thus, queried the promoter occupancy of our abnormally silenced genes in iPS clones, reprogramming pools, and cancer cell lines (,
Suppl Tables 4 and 5) in databases of others for embryonic cells (
37–
40). While there is no significant enrichment at the promoters of these genes for occupancy by the iPS reprogramming factors themselves(
37) save, for Nanog at the promoters of the DNA hypermethylated genes in and
Supplemental Table 5, in the clones and partially reprogrammed pools, there is an enrichment for PcG promoter marking(). This is confirmed by local chromatin immunoprecipitation (ChIP) of the PcG mark, H3K27me3, for promoter regions of selected of the above genes in ESC, iPS clone MP2, and the partially reprogrammed clone, MP4 ().