The Xi is distinguishable from its active counterpart by its epigenetic marks, including coating with
Xist RNA. This is the earliest event in X inactivation during embryogenesis, and gene silencing follows within one or two cell cycles [
7]. Interestingly,
Xist-induced silencing can only be achieved in early differentiating ES cells, and reaches a point of irreversibility. Just how
Xist RNA is spread along the Xi is still not fully understood. One hypothesis suggests that long interspersed repetitive elements (L1) repeats are overrepresented on the X and may serve as 'booster' elements by anchoring
Xist RNA to the chromosome, thus aiding spreading [
34]. Consistent with this hypothesis, human genes that escape X inactivation have fewer L1 repeats [
6,
35,
36]. These genes are also enriched in specific sequence motifs such as Alu repeats and short motifs containing ACG/CGT at their 5' ends [
37]. In the mouse, another type of repeat - long terminal repeats (LTRs) - appears to be depleted on escape genes [
19]. These observations imply that
Xist RNA coating could be deficient at genes escaping X inactivation. This was recently demonstrated in mouse myoblasts using RNA tagging and recovery of associated DNA (modified TRAP) method for identification of targets [
38]. In this study, escapees
Kdm5c and
Kdm6a, which encode chromatin-modifying histone lysine demethylases, were shown to be devoid of
Xist RNA coating over their promoters and transcribed regions. Conversely, genes subjected to X inactivation, and L1 repeat elements themselves, recruited
Xist RNA [
38] (Figure ). Taken together, these studies support the idea that specific DNA sequence motifs are involved in recruitment of
Xist RNA to the Xi.
While
Xist RNA coating is important in the initiation of X inactivation, many other epigenetic modifications follow to silence the X and maintain silencing. An early repressive chromatin mark, tri-methylation of lysine 27 on histone H3 (H3K27me3), is recruited by the Polycomb complex of chromatin-modifying proteins, resulting in compaction of the silenced portion of the Xi (Figure ). Other repressive marks include H3K9me3 and the histone variant macroH2A1, which are also enriched on the Xi (Figure ) [
7,
39]. Concomitantly, 'active' marks such as acetylation of histone H3 and H4 are lost from the silenced chromatin [
7,
40]. Modifications characteristic of silenced genes contrast with those within escape genes, which remain euchromatic and harbor histone H3 and H4 acetylation [
7,
41]. H3K4me3, another mark associated with transcriptional activity, is absent from most of the Xi except at discrete regions corresponding to areas of escape, as shown in female lymphoblasts [
42] (Figure ). We recently demonstrated a lack of H3K27me3 at escape genes in mouse, which shows complete concordance in the cell line used to assay allelic expression [
13].
The existence of discrete areas of 'escape chromatin' adjacent to silenced chromatin suggests the need for boundary elements, such as insulator sequences, that may block the spreading of heterochromatin into escape regions or prevent repressive marks from being added to escape domains (Figure ). Supporting this idea are our findings that the insulator protein CTCF (CCCTC-binding factor), which binds known insulator sequences, binds to the transition region between the escape gene
Kdm5c and the inactivated gene
Iqsec2 (IQ motif and SEC7 domain-containing protein 2) in mouse, whereas in humans, the corresponding region between the same genes, which both escape X inactivation, does not bind CTCF [
43]. Furthermore, we have found that the CpG island at the 5' end of
Kdm5c remains hypomethylated throughout mouse development, possibly because it is rendered inaccessible to DNA methyltransferases by CTCF binding (Figure ). CTCF-binding sites were also identified in other transition areas between escape and inactivated genes, suggesting that CTCF may play a role in the insulation of escape domains [
43]. However, a subsequent study showed that insertion of CTCF-binding sites from the
HS4 insulator site (from the chicken β-globin gene cluster) at each end of a short reporter gene was not sufficient to protect it from silencing when inserted within an inactivated gene on the Xi in mouse cells [
44]. A more recent study reported that a bacterial artificial chromosome clone containing
Kdm5c and its flanking regions retains its properties of escape even when inserted at other sites that are normally inactivated on the Xi in mouse cells [
45]. CTCF-binding sites may turn out not to be sufficient for insulation, and other elements within or around escape genes may be important.
In particular, the structure of chromatin may have an important role in insulation by looping specific regions out of the condensed Xi (Figure ) [
46]. Our recent X-chromatin profiles show a discontinuous distribution of the repressive chromatin mark H3K27me3 along the Xi, consistent with the presence of insulator elements and/or specific attachment sites for looped chromatin [
13]. However, in human × mouse hybrid cell lines, where the human X can be distinguished from the rodent background, repressive chromatin marks were found to be progressively diminished in the intergenic region between the inactivated
RBM10 (RNA-binding motif protein 10) and the escape gene
UBA1/UBE1 (ubiquitin-like modifier activating enzyme). Specifically, H3K9me3 and another histone modification associated with gene silencing, H4K20me3, were enriched in the last
RBM10 exon but were already depleted approximately 2 kb upstream of
UBA1/UBE1 [
41].
Escape from X inactivation can vary between different tissues and/or individuals and the escape status can also be developmentally regulated. In humans, about 10% of X-linked genes show variation in escape in different tissues and/or individuals [
12,
47]. Some escape genes may have a different chromatin structure throughout development, as suggested by the lack of promoter-restricted H3K4me2 in undifferentiated ES cells before X inactivation [
48]. Other escape genes may be initially silenced, and only reactivate in some tissues or with aging [
33]. Individual cells may also vary: in an analysis of single-cell allelic expression of
Kdm5c in mouse, significant silencing in individual embryonic cells was observed in contrast to consistent expression from both alleles in adult cells [
49]. Differences in H3K27me3 enrichment on some genes in a tissue and developmental-stage-specific manner also suggest variability in escape [
13]. For example, enrichment in H3K27me3 along
Mid1 (midline 1) in mouse embryos but not in adult liver suggests removal of the repressive mark in a tissue-specific manner. It is possible that the recently identified histone demethylases KDM6A and KDM6B may facilitate the removal of H3K27me3 at escape genes [
50-
52].