The basic genetic information that orchestrates development and differentiation of living organisms is encoded within the DNA. How such information is interpreted during the proper stage of development depends mainly on the interaction of DNA with proteins called histones. Histones package DNA into nucleosomes, which is major part of the packaging and organization of DNA within the nucleus (
Kornberg, 1974,
2007;
Luger et al., 1997). The N-terminal tails of histones protrude outward from the nucleosomes and are subject to various posttranslational modifications regulating the accessibility of DNA to factors involved in many DNA-mediated phenomena such as DNA replication, repair/recombination and transcription (
Luger et al., 1997). To date, acetylation, phosphorylation, methylation, ubiquitination, sumoylation, ADP-ribosylation, deimination and proline isomerization are characterized as histone posttranslational modifications (
Nowak and Corces, 2004;
Shilatifard, 2006;
Zhang, 2003). Whereas histone lysine acetylation and serine/threonine phosphorylation generally appear to regulate gene expression in a positive way (
Berger, 2002,
2007), the impact of histone lysine methylation on transcription often is dependent on the site of the modification and its biological context (
Shilatifard, 2006;
Zhang, 2003). Furthermore, the extent of the methylation of lysine (mono-, di-, and tri-) adds further to the complexity of the relationships between the lysine methylations and those biological outputs (
Shilatifard, 2008).
The subunits COMPASS, which are named Cps60 through Cps15 according to their apparent molecular weight, are all required for proper H3K4 methylation by the complex (
Krogan et al., 2002;
Schneider et al., 2005;
Shilatifard, 2006). For example, several subunits such as Set1, Cps50 and Cps30 are necessary for the mono-, di- and trimethylation of histone H3K4, whereas Cps40 and Cps25 are necessary just for the trimethylation of histone H3K4 (
Schneider et al., 2005). Following the purification of COMPASS from yeast, it was demonstrated that the mammalian MLLs also exist in COMPASS-like complexes capable of methylating the lysine 4 of histone H3 (
Hughes et al., 2004). We now know that there are at least seven COMPASS and COMPASS-like complexes in mammalian cells all capable of methylating H3K4 (
Shilatifard, 2008). Indeed many of the subunits shared from yeast to human, also function similarly in both organisms (
Shilatifard, 2008).
Our Global Proteomic Analysis in Yeast (GPS) studies in yeast also demonstrated that histone H2B monoubiquitination at lysine 123 residue (K123) is required for proper methylation by COMPASS (
Dover et al., 2002). We have shown that histone H2B monoubiquitination is detected at both the promoters and coding regions of the gene. The enzymatic machinery implementing H2B monoubiquitination was originally identified in yeast (
Dover et al., 2002;
Wood et al., 2003). Yeast Rad6, is an E2 ubiquitin-conjugating enzyme (
Robzyk et al., 2000), and yeast Bre1, its E3 ubiquitin ligase, directly interacts with Rad6 to form a stable complex and monoubiquitinate H2BK123 (
Wood et al., 2003). Similar to COMPASS, Rad6 and Bre1 are found in both yeast and mammals (
Kim et al., 2009;
Kim et al., 2005;
Pavri et al., 2006;
Zhu et al., 2005). Following the studies in yeast demonstrating a role for the Rad6/Bre1 complex in the regulation of COMPASS’s function, it was demonstrated that human hRad6A and hRad6B and hBre1A and hBre1B can monoubiquitinate histone H2B and function in H3K4 methylation in vivo (
Shilatifard, 2008). In this manuscript, we will review our structural and molecular understanding of factors and machineries involved in the regulation of COMPASS’s H3K4 trimethylase activity.
Components of COMPASS involved in the regulation of H2B monoubiquitination H3K4 methylation crosstalk
Since composition and catalytic activity of COMPASS is conserved from yeast to human, yeast has provided an outstanding model for a molecular understanding of H3K4 methylation by COMPASS and COMPASS-like complexes in mammals. Our initial studies in yeast demonstrated that COMPASS purified from strains lacking H2B monoubiquitination to have several fold less of the Cps35 subunit associated within COMPASS (
Lee et al., 2007). Furthermore, COMPASS purified from an H2B monoubiquitination deficient background lacks the enzymatic ability to trimethylate histone H3 (
Lee et al., 2007). These observations pointed towards Cps35 as the subunits of COMPASS involved in regulating the crosstalk between H2B monoubiquitination and H3K4 trimethylation. In support of this observation, Cps35 interacts with chromatin in a monoubqiutination-dependent manner (
Lee et al., 2007;
Takahashi et al., 2009). Furthermore, the Cps35 homologue in humans, the Wdr82 protein, is a component of the human Set1 complex (hCOMPASS) (
Lee and Skalnik, 2008;
Wu et al., 2008) and requires H2B monoubquitination for its interaction with chromatin (
Wu et al., 2008).
Molecular mechanism installed within COMPASS for the regulation of H3K4 di- and trimethylation
As discussed above, the highly intricate network is found to regulate H3K4 methylation outside COMPASS. Recent studies start to clarify some novel aspects of differential regulation functioning within COMPASS to achieve the proper H3K4 methylation states. Ahead of the latest studies, Schneider et al., set out a comparative GPS by using all of the three specific antibodies towards the mono-, di-, and trimethylation of H3K4, and in combination with the biochemical analysis, determined each function of Cps40 and Cps60 (presumably through the Cps60-Cps25 subcomplex) in transitioning from di- to trimethylation and from mono- through trimethylation, respectively (
Schneider et al., 2005). The roles of the WD repeat proteins, Cps50 and Cps30, in the regulation of H3K4 methylation have remained elusive because of a lack of instability of Set1 within Cps50 or Cps30-deficient COMPASS (
Schneider et al., 2005; Dehe et al., 2006). Following studies (
Dou et al., 2006;
Patel et al., 2008a;
Steward et al., 2006) using a defined
in vitro system of recombinant proteins of MLL1 C-terminal SET domain-containing fragments and other core components (WDR5 (hCps30), RbBP5 (hCps50), and ASH2L (hCps60), set out to investigate these roles in reconstitution assays. We confirmed the specific Cps60-mediated regulation for H3K4 di- and trimethylation (
Steward et al., 2006). However, H3K4 trimethylation was not observed for the core MLL1 complex reconstituted with recombinant proteins prepared by using E. coli as a host strain (
Patel et al., 2008a).
In addition to the requirement of the subunits of COMPASS in regulating the pattern of mono-, di-, and trimethylation, several key residues within Set1’s catalytic pocket are also capable of regulating the trimethylation activity of COMPASS (
Takahashi et al., 2009). One such residue is the Tyrosine 1052 (Y1052) of Set1. Our studies have demonstrated that Y1052 plays an essential role in the regulation of the available space within the catalytic pocket of Set1, and thereby regulating the transitioning from mono-to di- to trimethylation (). This residue Y1052 is of great interest to us due to the fact that it corresponds to a site known as the F/Y switch governing product specificities for several of the known histone methyltransferases (-). SET domain monomethyltransferases such as SET8 and SET7/9 bear a Y in this position, and the SET domain di- and trimethyltransferases, such as G9A and DIM-5, harbor phenylalanine or other hydrophobic residues in this site. COMPASS is an exception to this model as the SET domain of Set1 possesses a tyrosine in the F/Y switch position, but the enzyme is capable of catalyzing the mono-, di-, and trimethylation of H3K4.
We have proposed that a subunit of COMPASS, Cps40, which is required for proper H3K4 trimethylation by the complex, is the subunit responsible for this exception to the F/Y switch model for COMPASS. We put into action a combinatorial analysis of COMPASS in both the presence and absence of several of its subunits including
cps40 and
cps60, and site-directed mutagenesis focusing on the putative F/Y switch of the active site, which governs the product specificities (
Takahashi et al., 2009). We proved that the F/Y switch model is certainly operative in Set1 of COMPASS by substituting the Tyr1052 with Phe (Y1052F), with the specific enhancement of the H3K4 trimethylation activity exhibited
in vivo and
in vitro (
Takahashi et al., 2009). More importantly, the enhancement effects suppressed the H3K4 trimethylation defect of Cps40-deficient COMPASS and even reached both the H3K4 di- and trimethylation defects of COMPASS lacking Cps60, as well. Those results suggested not only that the F/Y switch rule dominates all of the SET-domain proteins including the SET1 family, but also that the Tyr1052 residue of Set1 functions together with Cps40 and Cps60 to implement the H3K4 di- and trimethylations properly (
Takahashi et al., 2009) (). The SET1 family of proteins may have evolved to obtain the intrinsically down-regulated catalytic domain to adapt to the necessity of the differential and diverse levels of H3K4 methylation regulation through the integration of the signals of their various unique and common subunits.
Apart from the SET domain, several regulatory domains or motifs within the Set1 polypeptide were identified through genetic and biochemical analyses. Schlichter and Cairns characterized and suggested that the RRM (RRM1) and autoinhibitory domains N-terminal to the SET domain to be positive and negative modulators toward H3K4 di- and trimethylation, respectively (
Schlichter and Cairns, 2005). The following study found another RRM domain (RRM2) adjacently to the C-terminal to RRM1 and the destabilizing effect on Set1 of the RRM1 mutation (Tresaugues et al., 2006), implying that the inactivation of H3K4 methylation of the RRM1 mutant Set1 may be due to the loss of COMPASS’s integrity, not the catalytic effects. It is of note that the amino acid sequences from the N-terminal to the SET domain are relatively divergent than the above mentioned RRM1, RRM2, and that the auto-inhibitory domains are only conserved among
S. cerevisiae Set1, though not among human Set1A and Set1B, and the MLL1-4 proteins. Prominently, in recent studies, the
WDR5-
interaction motif, Win motif, which is strictly conserved among human Set1A, Set1B, and the MLL1-4 proteins, was identified immediately upstream of the SET domain of MLL1 together with the co-crystals of WDR5 and a Win peptide from MLL1 (
Patel et al., 2008a;
Patel et al., 2008b;
Song and Kingston, 2008). Their strong affinities with the dissociation constant of sub-micromolars between the recombinant core MLL1 fragment and WDR5 in a stoichiometric ratio can be attributed solely to the Win motif, suggesting that the Win motif is the critical determinant for the binding to WDR5 of the human Set1 and MLL proteins within hCOMPASS and hCOMPASS-like complexes. For unknown reasons, the Win motif is not obviously conserved in
S. cerevisiae Set1 (data not shown). However, the C-terminally truncated Set1 that includes the N-SET region could be stably purified with Cps30 (WDR5 in humans) and Cps50. Cps30 may still bind to Set1 in a similar way to WDR5 and MLL1, but through different residues within the N-SET region.
Recently, Vitaliano-Prunier et al. have proposed a Cps35-mediated activation of COMPASS, however, by a different mechanism than that which we have proposed (
Vitaliano-Prunier et al., 2008). Vitaliano-Prunier have suggested that Cps40 of COMPASS accomplishes H3K4 trimethylation crosstalk based on the observations that Cps35 was subject to ubiquitination by the Rad6/Bre1 complex when both Cps35 and ubiquitin are overexpressed in yeast (
Vitaliano-Prunier et al., 2008). Vitaliano-Prunier and colleagues also have proposed that such ubiquitination of Cps35 is involved in the recruitment and association of Cps40 with COMPASS and chromatin. The Cps35 ubiquitination remains quite controversial because the observed Cps35 ubiquitination was under overexpression conditions for both Cps35 and ubiquitin, and the
in vitro reconstituted analysis using the purified components were not performed (
Vitaliano-Prunier et al., 2008). Indeed, Vitaliano-Prunier and colleagues are not able to detect even a 1% ubiquitination of Cps35 when they overexpress both Cps35 and ubiquitin. Furthermore, our analysis of the H2B monoubiquitination deficient cells such as
rad6Δ and the H2BK123R strains clearly demonstrates a stable Cps40 association on chromatin in an H2B monoubiquitination-independent manner as well as its stable assembly within COMPASS (
Takahashi et al., 2009). Furthermore, when COMPASS is purified from strains lacking Cps35, the same levels of Cps40 are purified with COMPASS suggesting that Cps35 has very little to do in recruiting Cps40 to COMPASS as suggested by Vitaliano-Prunier and colleagues (
Takahashi et al., 2009). Overall, our studies suggest that H2B monoubiquitination inducing the Cps35 assembly into and activating COMPASS is a distinct mechanism from the Cps40-mediated adjustments of the H3K4 trimethylation activity in conjunction with the Y1052 residue in the Set1 catalytic site.
H2B monoubiquitination is not the only crosstalk for H3K4 methylation by COMPASS
In addition to H2BK123 monoubiquitination, the surfaces of the nucleosome can play important roles for the implementation of H3K4 methylation. The
scanning
histone
mutagenesis with
alanine (SHIMA) library analysis elegantly showed and elucidated a novel patch on the histone H2A/H2B dimer and a lysine residue of the histone H3 N-terminal tail (H3K14) as trans- and cis-regulatory regions for H3K4 methylation, working possibly through the effects on the H2B ubiquitination machineries and/or the direct interactions with COMPASS (
Nakanishi et al., 2008). Interestingly, it was recently revealed that other histone lysine methylations such as H3K36 and H3K79 also require the respective histone trans-regulatory regions and corresponding interactions with their methyltransferase enzymes such as H4K44 and the N-terminal acidic patch of Set2, and the H4R17-H18-R19 basic patch and the C-terminal acidic patch of Dot1. Compared with these two trans-regulatory pathways, which are obviously based on the direct charge-based electrostatic interactions between each enzyme and histone protein in the nucleosome, the COMPASS-associated histone regulatory regions are relatively complicated, because the H2A/H2B patch consists of a mixture of both acidic (H2AE65-N69-D73) and basic (H2BH112-R119) residues, the latter of which is proximal to the monoubiquitination site (K123) of H2B, and furthermore, some of those residues affect H2B monoubiqutination positively and others do so negatively. Nevertheless with similar H3K4 di-and/or trimethylation defects observed when measuring bulk histones from the different mutants (
Nakanishi et al., 2008), further detailed genetic and biochemical analyses are required for an exact understanding of those these regulation mechanisms. However, histone regulatory pathways requiring these novel crosstalks among histone methyltransferases and nucleosome histone surfaces are certainly one of the emerging concepts in the studies of the epigenetic histone posttranslational modification network.