Cytosine methylation - the formation of 5-methylcytosine (5mC) at CpG sites - is an important epigenetic modification used by mammals to mediate transcriptional regulation, including transcriptional repression, X-chromosome inactivation, imprinting, and suppression of parasitic sequences (
Bird, 1992;
Kass et al., 1997;
Siegfried and Cedar, 1997). The establishment and maintenance of the correct DNA methylation patterns at CpG sites is essential in mammals during development, gametogenesis and differentiation of somatic tissues. Indeed alterations in DNA methylation patterns, with the associated chromatin changes, have profound consequences, as demonstrated by embryonic lethality in the absence of DNA methylation (
Li et al., 1992;
Okano et al., 1999), developmental defects and accelerated aging in cloned mammals (
Rideout et al., 2001), and characteristic epigenetic changes in cancer, such as global genome hypomethylation and tumor suppressor gene hypermethylation (
Feinberg and Tycko, 2004;
Jones and Laird, 1999).
While DNA methylation is mediated by
de novo DNA methyltransferases (DNMT3a and DNMT3b) that act on unmethylated DNA and maintenance DNA methyltransferases (DNMT1) that act on newly replicated, transiently hemimethylated DNA, the demethylating activities or processes that remove methylation marks in mammals are largely unknown. Indeed, it has been controversial as to whether demethylation is an active process in mammals (
Ooi and Bestor, 2008) and which mechanisms are involved (
Wu and Zhang, 2010).
Demethylation can occur passively due to replication in the absence of re-methylation, with consequent dilution of this modification. However, there is evidence supporting the occurrence of active demethylation in mammals, including demethylation of the paternal genome shortly after fertilization (
Mayer et al., 2000;
Oswald et al., 2000), demethylation to erase and reset imprinting in primordial germ cells (
Reik et al., 2001;
Surani et al., 2007), demethylation during somatic differentiation of the developing embryo to establish tissue-specific gene expression patterns (
Kress et al., 2006;
Niehrs, 2009) and during gene activation in adult kidney (
Kim et al., 2009) and brain (
Ma et al., 2009). In addition, it is generally thought that active transcription contributes to the maintenance of the unmethylated state of promoter-associated CpG-rich sequences known as CpG islands, but the molecular details of protection from hypermethylation and the potential involvement of an active demethylation process are unknown (
Illingworth and Bird, 2009).
Accumulating evidence in non-mammalian model organisms point to the involvement of DNA repair mechanisms in active demethylation (
Gehring et al., 2009;
Niehrs, 2009). In Arabidopsis, the base excision repair (BER) proteins Demeter and ROS1 affect demethylation by directly removing 5mC through their glycosylase activities (
Gehring et al., 2006;
Morales-Ruiz et al., 2006). In Xenopus, demethylation has been reported to be initiated by the genome stability protein Gadd45a (growth arrest and DNA damage-inducible protein 45 alpha) in a process dependent on the nucleotide excision repair protein XPG (
Barreto et al., 2007); however the role of mammalian GADD45 in demethylation (
Barreto et al., 2007;
Schmitz et al., 2009) has been challenged (
Jin et al., 2008). In zebrafish embryos, rapid demethylation of exogenous and genomic DNA occurs in two coupled steps: enzymatic 5mC deamination to thymine by Activation Induced deaminase (AID) or Apolipoprotein B RNA-editing catalytic component 2b and 2a (Apobec2b, 2a), followed by removal of the mismatched thymine by the zebrafish thymine glycosylase MBD4, with Gadd45 promoting the reaction (
Rai et al., 2008). Recently, 5-hydroxymethylcytosine (5hmC), an oxidative product of 5mC generated by the Tet hydroxylases (
Kriaucionis and Heintz, 2009;
Tahiliani et al., 2009), has been proposed as a demethylation intermediate (
Globisch et al., 2010;
Wu and Zhang, 2010). During gene activation in the adult mouse brain, demethylation by TET1-mediated hydroxylation of 5meC to 5hmC was promoted by AID/Apobec deaminases, in a process that generates 5-hydroxymethyluracil (5hmU) and ultimately requires BER, although the specific glycosylases involved were not identified (
Guo et al., 2011).
Numerous in vitro studies have documented a potential role of the BER enzyme TDG (thymine DNA glycosylase) in transcriptional regulation and demethylation. Indeed, TDG interacts with several transcription factors, including retinoic acid receptor (RAR), retinoid X receptor (RXR) (
Um et al., 1998), estrogen receptor α (ERα) (
Chen et al., 2003), thyroid transcription factor 1 (TTF1) (
Missero et al., 2001) and histone acetyl-transferases p300 and CBP (
Tini et al., 2002). It has been proposed that TDG may be responsible for demethylation, either through a direct 5mC glycosylase activity (
Zhu et al., 2000), or indirectly, by acting on G:T mismatches originated by a controlled deaminase activity of DNMT3a and DNMT3b (
Metivier et al., 2008). Very recently, TDG was shown to be involved in maintaining active and bivalent chromatin marks in mouse embryo fibroblasts and ES cells undergoing neuronal differentiation, respectively, but the mechanism for such epigenetic effects and the requirement of its catalytic activity were not clarified (
Cortazar et al., 2011). To investigate the functional role of TDG in epigenetic regulation, DNA demethylation and mammalian development, we generated mice with targeted inactivation of the TDG gene.
Tdg-null embryos die in midgestation and exhibit a complex developmental phenotype that appears to derive from the failure to establish and maintain proper DNA methylation patterns at promoters and enhancers. A knock-in mutation that inactivates the glycosylase function of TDG is also embryonically lethal, and TDG is found in a complex with AID and GADD45a. These findings suggest a two-step catalytic mechanism for DNA demethylation that is essential for mammalian development.