In eukaryotic nuclei, DNA associates with proteins to generate the heritable genetic information contained within chromatin. Transcriptionally active regions of the genome reside in euchromatin, whereas inactive regions reside in heterochromatin. Regulation of euchromatic and heterochromatic regions forms the basis of epigenetic gene expression control (
1) and proper chromosome condensation during meiosis (
2,
3). Alterations in chromatin structure have a vital role in changing the gene expression. Many alterations are targeted by specific chromatin remodeling complexes (
4) and/or by DNA modification enzymes (
5) directed at nucleosomes, the fundamental structural unit of chromatin. Nucleosomes are composed of 146 bp of DNA wrapped around an octamer of core histone proteins H2A, H2B, H3 and H4 (
6) that are often modified at specific amino acid residues (
7). These post-translational modifications include acetylation, phosphorylation, poly-ADP-ribosylation, ubiquitination and methylation (
8). Such modifications have major roles in regulating gene transcription, heterochromatin formation, X chromosome inactivation, DNA replication, DNA repair and cellular memory (
8–
13).
Histones are small basic proteins with globular core domains and flexible, charged N-terminus regions that protrude out of the nucleosome core leaving them susceptible to post-translational modifications. In histone H3, five lysine residues (K4, K9, K27, K36 and K79) and K20 in histone H4 may become methylated. Of the different site-specific H3 methylating sites, K9 methylation is well known for its role in heterochromatin formation, transcriptional silencing, X chromosome inactivation and DNA methylation (
14). Several enzymes for K9 methylation have been purified and studied. The first was Suv39h1, cloned and expressed by Jenuwein and co-workers (
15). Another close relative, Suv39h2, was also cloned and characterized (
16). Both Suv39h1 and h2 contain SET [(Su(var), Enhancer of zeste, Trithorax] domains and their catalytic activities are dependent on this domain. The SET domain has a unique structure (
17), rich in β-sheets, with each small sheet containing only a few strands. Although these K9 methyltransferases use AdoMet as a cofactor, no similarities with the canonical α/β AdoMet-dependent methyltransferases have been described previously (
18). Furthermore, both Suv39h1 and h2 contain a chromo domain that is considered a chromatin regulatory motif. The SUV39H family of proteins contributes to the organization of repressive chromatin regions, such as the centromeres (
19), and is co-localized with heterochromatin-binding protein 1 (HP1) in heterochromatic regions of the mammalian genome (
20).
Apart from the SUV39H family enzymes, three additional mammalian H3-K9 methyltransferases—G9a, ESET/SETDB1 and Eu-HMTase 1—have been reported previously (
21–
23). Unlike the SUV39H family, these proteins participate in H3-K9 methylation in euchromatic regions of the genome (
24). Along with enzymatic catalysis, Eu-HMTase 1 also associates with E2F-6, Mga and Max proteins and may be involved in the silencing of E2F and Myc responsive genes in quiescent cells (
23). The fact that multiple histone methyltransferases are capable of the same target lysine methylation raises the issue of redundancy in their functions. Targeted inactivation of G9a resulted in growth retardation and early embryonic lethality, suggesting that G9a-mediated H3-K9 methylation in euchromatic regions is essential for development (
25) and cannot be compensated for by the other three known K9 methyltransferases. It is possible that different histone methyltransferases methylate K9 at different times during the cell-cycle or development. Another H3-K9 methyltransferase, ESET, participates in early mammalian development (
26). Xin
et al. (
27) also demonstrated that H3-K9 methylation of the Prader-Willi syndrome imprinting center is reduced in G9a-null mouse ES cells, and maintenance of CpG methylation in the Prader-Willi syndrome imprinting center requires G9a function, bringing two global epigenetic modifiers, DNA and histone methylation, together. Furthermore, G9a is a transcriptional repressor of PRDI-BF1, a DNA-binding protein involved in post-induction repression of interferon-β gene transcription in response to viral infections (
28).
G9a is expressed ubiquitously and was originally mapped to the class III region of the major histocompatibility complex locus (
29). Mouse G9a is 1263 amino acids long. The amino half of G9a has very little similarity with any conserved protein domain. However, the carboxy half has homology with ANK (ankyrin) repeats, and contains preSET and SET domains. The SET domains are conserved in chromatin-modifying enzymes (
22) that include histone methyltransferases (
30), which recognize histone tails and modify specific lysine residues (
31). A second distinguishing feature of G9a is the presence of several ANK repeats implicated in protein–protein interactions in very diverse protein families. G9a is also homologous to human SUV39H1 and yeast Clr4 family proteins.
Although G9a is crucial for transcriptional silencing and development, very little is known about its functional domains and its association with chromatin. Furthermore, there is emerging evidence for chromatin modifying enzymes, such as histone methyltransferases, in establishing and maintaining gene silencing either alone (
32) or in cooperation with DNA methyltransferases (
33). Here, we have evaluated the functional domains of mouse G9a for nuclear localization features and the minimal elements required for catalysis. The functional significance of conserved C-terminus amino acids for enzymatic activity was also determined.