Early human embryo development results in the reprogramming of highly specialized germinal cells into totipotent and then pluripotent cells that are the progenitors of all the specialized cell types of the human body. This unique biological property has been harnessed to restore pluripotency in human somatic cells by SCNT or cell-fusion using embryonic stem cells [
6,
7]. hESC share with pluripotent stem cells from the inner cell mass pluripotency transcription factors and multi-lineage differentiation properties, and are considered a good
in vitro model for pre-embryo pluripotent stem cells. Though human oocytes and ESC are developmentally separated by less than one week, the transcriptome of the oocyte undergoes rapid changes after fertilization [
24,
25]. We undertook to find out a common expression signature to these two cell types, that share somatic cell reprogramming properties, by comparing them to a large collection of somatic tissues samples. A first observation was that the oocytes/hESC signature was highly enriched in genes involved in cell cycle. Whereas this was expected because of the cell cycle status of these two cell types, the expression of a large set of genes associated with cell division is nevertheless of interest for cellular reprogramming. As recently reported, prior mitotic remodeling of the somatic nuclei, involving topoisomerase II (TOP2)-dependent shortening of chromatin loop domains and an increased recruitment of replication initiation factors onto chromatin, is essential for reprogramming of differentiated nuclei [
26]. Strikingly, we found that
TOP2A was highly up-regulated in both oocytes and hESC. This observation suggests that
TOP2A could be a major factor in the reprogramming properties of oocytes and hESC by participating in chromatin remodeling. Conversely, the identification of a "cell cycle signature", shared with highly proliferating tissues such as cancer cell lines, provided a mean to identify by subtraction a "non-cycle oocytes/hESC" signature of 432 PS. This signature included transcripts coding for proteins involved in chromatin structure modifications such as DNMT3B, JARID2, SMARCA5 or CBX5 that contribute to the DNA methylation and chromatin remodeling (Figure ). Consistent with these observations, hESC display a distinct, permissive, chromatin structure compared with other tissues [
27]. Expression of DNA methyl-transferases or several ATP-dependent chromatin remodelling factors are elevated in murine oocytes or ES cells [
28,
29]. Thus our findings show large similarities between murine and human ES cells, and put forward several genes whose strong overexpression could contribute to the specific chromatin state of hESC.
Another lesson from our transcriptomic approach is that the common oocytes/hESC gene expression profile has a very low number of genes that are either secreted or membrane bound (Figure ). This is in line with our previously published data that booth oocytes and hESC "specific genes" are significantly depleted in extracellular signalling components, suggesting that this feature is indeed a common characteristic shared by oocytes and hESC and is not simply due to a lack of overlap [
18,
20]. Hence, genes specifically shared by oocytes and hESC are largely nuclear proteins. One assumption that can be inferred from these findings is that determinant of pluripotency may be mostly intrinsic factors. This observation converges on a recent model, which proposed that pluripotency is a ground state that is intrinsically self-maintained when protected from extrinsic differentiation stimuli [
30].
An unexpected observation was that genes involved in protein ubiquitination and proteasome pathway were also overrepresented in the oocytes/hESC signature. This could be linked to the strong proliferation signature of hESC and oocytes as this pathway is by many way implicated in the regulating the cell cycle [
31]. However, the overexpression of the ubiquitination/proteasome pathway was still significant when the cell cycle signature was substracted, suggesting that this pathway could have a role in pluripotent cells in addition to its house keeping or cell cycle functions. In line with these results, we showed a selective sensitivity of hESC to the inhibition of the activity of the proteasome, resulting in loss of pluripotency and cell growth at doses without any detectable effects on differentiated but cycling cells such as primary fibroblasts or hESC derived fibroblast like cells. In addition, it must be stressed that the dramatic effects on hESC pluripotency were observed at doses of the proteasome inhibitor MG132 (0.5 μM) significantly lower than those typically found in the literature (several μM) or in mice ES cells (20 μM) [
23]. This observation is highly interesting in light of the recent findings of the role of the proteasome in transcription, especially in hESC. The 26S proteasome consists of a 20S core proteolytic part, capped by a 19S regulatory complex. Specificity of degradation of proteins is mediated in part by poly-ubiquitination of the substrate bound for destruction. Based on early work in yeast, the proteasome is known to interact with chromatin and function at multiple steps in transcription, both through proteolytic and non-proteolytic activities [
32]. Recently, Szutorisz et al. reported that the 26S proteasome is assembled on intergenic and intragenic regions in ES cells and act as a transcriptional silencer by blocking non-specific transcription initiation [
23]. This mechanism involves the proteolytic activity of the 20S core by degrading non-specific preinitiation complexes, thereby preventing permissive transcription and spreading of the modified chromatin. Our results are consistent with this hypothesis, but final answer on this issue will require further investigations.
This work has compared human MII oocytes and hESC to somatic tissues gene expression profiles. One goal was to provide new hints on the process of nucleus reprogramming which takes place in vivo during early embryo development or in vitro during SCNT, and may thus help to improve the iPS technology. Indeed, since the seminal work of the team of Shinya Yamanaka, numerous improvement have been made, including the identification of new genes able to replace some of the original ones in the reprogramming cocktail, the use of small molecules or the replacement of the retroviral vectors by adenoviruses or plasmids [
33-
37]. A first observation is that human mature oocytes do not express the pluripotency core transcriptional genes
POU5F1/OCT4,
NANOG and
SOX2 [
13], except
POU5F1/OCT4 at low level (see Figure and our Amazonia! on-line expression atlas,
http://amazonia.montp.inserm.fr). They neither express
KLF4 nor
CMYC, which compose, with
POU5F1/OCT4 and
SOX2, the four factors that can reprogram somatic cells by virus-mediated overexpression [
11]. From the six "reprogramming" factors described to date, only LIN28 was found in the oocyte/hESC signature. However, POU5F1/OCT4, NANOG, SOX2, KLF4, LIN28 and CMYC are all expressed by hESC. Therefore, during early embryo development, the expression of these genes is induced. Thus, two different molecular pathways that can reprogram adult somatic cells can be envisioned: (i) the process taking place in the oocyte cytoplasm, able to activate the core transcriptional genes, or (ii) the overexpression of the core transcriptional genes themselves together with adjuvant genes, either by viral overexpression or by fusion with cells already expressing these genes. It can be speculated that the factors that lie upstream of the pluripotency core transcriptional circuitry are expressed as mRNA in mature MII oocytes and are still present at blastocyst stage from which hESC are derived. Thus, the oocytes/hESC signature likely includes these factors, and therefore this information could be highly informative for cell reprogramming. The signature contained numerous transcritption factors, including many zinc finger such as
ZNF84, several still poorly annotated genes such as
KLHL7,
MRS2, or the Selenophosphate synthetase 1 (
SEPHS1), displayed a strong cell cycle signature, chromatin modification genes, and also many actors of the proteasome pathway. All these genes are candidate genes to improve the efficiency of iPS generation, especially in the light of the recent advances that uses non retroviral vectors but at the cost of lower efficacy.