MeCP2 belongs to a family of conserved vertebrate proteins that bind to symmetrically methylated CpG dinucleotides and, at least in some cases, transmit epigenetic signals encoded in DNA methylation (reviewed in
1,
2). The finding that mutations in human MeCP2 result in Rett syndrome (RTT), a debilitating neurodevelopmental disorder (
3), and that its mis-regulation is common in other patients with Autism Spectrum Disorders (
4,
5), stimulated a focused effort to determine its function(s) and mechanism(s) of action. It is now clear from work with both humans and mouse models which recapitulate many of the human symptoms, that MeCP2 is required for the development and maintenance of neurons in some regions of the brain (
5). However, it has been difficult to determine the molecular event(s) that are affected by MeCP2 deficiency or by RTT-causing mutations which lead to functional deficit(s). A complicating issue is that the function of MeCP2 appears to be largely context dependent, varying with species, tissue and cell type, and presence of binding partners. Xenopus MeCP2, for example, is important in neuronal fate decisions in early embryogenesis, a function not seen in mice (
6). In humans, MeCP2 deficits have been linked to conditions other than RTT, including cancer (
7–
10).
Protease-resistance and modeling studies have revealed that MeCP2 is a striking example of an intrinsically unstructured protein containing protease resistant domains having varying degrees of disorder (
11). Early work identified a short (~90 residue) ordered region of MeCP2 (between residues 75–164) with the ability to bind methylated DNA (
12). This region, named the methylated DNA binding domain (MBD), is highly conserved (only four amino acid differences between Xenopus and humans). Its structure has been determined by both NMR and x-ray diffraction (
13–
15). DNA methylation is typically associated with transcriptional repression, and indeed MeCP2 has been shown to repress methylated genes in artificial systems
in vitro (
16). This transcriptional down-regulation activity was mapped to amino acids 207–310 and the region was accordingly termed the transcriptional repression domain (TRD) (
16). The MBD and TRD together comprise ~40% of the 486 residue hMeCP2 sequence (, upper diagram).
One early proposed mechanism of methylation-dependent repression involves an initial binding of MeCP2 to methylated DNA via the MBD, followed by TRD-mediated recruitment of the Sin3A co-repressor and histone deacetylase complexes (HDACs). The subsequent deacetylation of histones in nucleosomes would render the local chromatin region more refractory to transcription (
17). However, it is now clear that the mechanism of repression is often more complex, involving multiple MeCP2 binding partners. For example, in non-neuronal Rat-1 cells, repression of the neuron-specific
NaCh type II gene by MeCP2 appears to involve at least three direct MeCP2 binding partners (methylated DNA, CoREST, and the histone H3 lysine methyl transferase SUV39H1) as well as indirect contributions from REST/NRSF and HP1 (
18). In addition, the TRD of MeCP2 has been shown to be an important recruitment platform for several transcriptional modulators and epigenetic regulators in addition to mSin3A and HDACs. These include Ski, N-COR (
19), DNMT1 (
20), histone H3K9 methyltransferase (
21), PU1 (
22), splicing factors (
23), BRM (
24), RNA (
25), and the RNA splicing machinery (
26). The identification of numerous complexes that interact with MeCP2 suggests that additional modes of MeCP2 function remain to be discovered and call further attention to its identity as an intrinsically unstructured protein (
11), which characteristically has large number of binding partners and multiple functions (
27).
Further insight into the complexity of MeCP2 biology has come from recent genome-level studies. These revealed that MeCP2 binding is not confined to chromatin containing methylated DNA (
28), and that MeCP2 binding can lead to both repression and up-regulation depending on the gene context (
29). Evidence has also been presented that MeCP2 is involved in the maintenance of large-scale chromatin loops, perhaps by physically anchoring loop bases (
30). This suggestion is consistent with the ability of MeCP2 to promote nucleosome-nucleosome interactions in vitro, a property that is enhanced by, but not dependent on DNA methylation (
31–
33). These findings establish that MeCP2 is a multifunctional protein, and suggest that the different functions are highly context dependent.
With the exception of the MBD, very little is known about the structural properties of MeCP2 and how they contribute to the functional complexity of the intact protein. Studies of RTT-causing MeCP2 mutations show that the most prominent are a few missense mutations in the MBD that disrupt its structure and affect folding (
13,
34). There are, however, RTT-causing mutations throughout the entire molecule (see Rett syndrome database at
http://mecp2.chw.edu.au/mecp2/), indicating that regions of MeCP2 other than the MBD and TRD contribute to its multiple functions. Indeed, several reports have associated specific functions with individual regions. For example, the N-terminal domain of MeCP2 has been shown to mediate interactions with HP1 needed for transcriptional silencing during myogenic differentiation (
35). The short (~45 residue) domain connecting the MBD and TRD has been recently shown to be instrumental in stable MeCP2 binding to chromatin in vivo (
36), and a study of RTT patients showed that mutations tend to be located in this region (
37). Finally, the C- terminal portion of MeCP2 required for chromatin interactions
in vitro (
31) also harbors the Group II WW domain binding motif required for binding to splicing factors (
23), and the SPxK DNA-binding motif found in histone H1. The importance of the C-terminal region for MeCP2 function is underscored by the frequent occurrence of C-terminal deletions in RTT patients (
37).
In order to better understand the interactions and functions of the different domains of human MeCP2, we have undertaken a systematic study of their properties, focusing on their structure, their interactions, and their DNA and chromatin binding abilities. Our work shows that the different domains are highly diverse in many respects, revealing novel properties and providing new mechanistic insights regarding the overall structure of the protein. The MBD and TRD, with their ability to bind methylated DNA and unmethylated DNA, respectively, are clearly key functional elements. Here we show that the N-terminal domain flanking the MBD modulates the affinity of MBD-DNA binding. Further, the intervening domain (ID) between the MBD and TRD possesses a strong, autonomous methylation-independent DNA-binding activity and also facilitates MBD dependent binding. We also report that some domains show a dramatic acquisition of secondary structure upon DNA binding and, while there are 4 autonomous DNA binding domains in MeCP2, considerable synergism exists in their mode of binding. Further, when bound to DNA, some domains increase the stability of MeCP2. Specific inter- domain interactions are seen both
in cis, and
in trans, suggesting that these physical couplings play an important role in MeCP2 structural organization and function. We have also expanded our understanding of MeCP2 as an intrinsically unstructured protein, and show that it has an unusually large number of interspersed
Molecular
Recognition
Features (MoRFs) (
38,
39), short regions predicted to acquire structure when complexed with binding partners. The occurrence of several RTT-causing mutations within MoRFs further underscores their importance in MeCP2 function. Taken together, these studies significantly advance our understanding of the molecular basis of the unusual structure of MeCP2, and its relationships to DNA binding and the modulation of chromatin conformation.