H3R2me2s is present at recombinationally active antigen receptor loci
To determine whether H3R2 is symmetrically dimethylated in mammalian cells and to explore the relationship between H3K4me3 and H3R2me2s, we generated two affinity-purified antibodies. The specificity of each affinity-purified antiserum was validated by peptide dot blot analysis (Figure S1A
). The first antibody, α-pan-H3R2me2s, showed >25 fold preference toward H3R2me2s over H3R2me2a and ~5 fold preference for H3R2me2s over H3R2me2sK4me3 (Figure S1A
, top left panel). The second antibody, α-H3R2me2sK4me3, recognized only the H3R2me2sK4me3 peptide and not either modification alone (Figure S1A
, bottom left panel).
Both antibodies robustly recognized histone H3 in Western blot analysis of nuclear extracts derived from a lymphoid cell line poised to carry out V(D)J recombination between the IgH D and J segments (Figure S1B
). Peptide competition Western blots of the pro-B cell nuclear extracts confirmed that the histone H3 signal was due to bona fide recognition of H3R2me2s and/or H3R2me2sK4me3 (Figure S1C
). Chromatin immunoprecipitation followed by qPCR (ChIP-qPCR) revealed that H3R2me2s, H3K4me3, and H3R2me2sK4me3 are all enriched at actively rearranging gene segments in developing lymphoid cells (). Thus, H3R2me2s is a novel histone modification present in developing lymphoid cells. Moreover, because the H3R2me2sK4me3 antibody has the unusual property of requiring the simultaneous recognition of two histone modifications, H3R2me2s and H3K4me3 must reside on the same histone tail, at least on some histones, providing an opportunity for RAG2 to simultaneously bind to both methylated residues.
Figure 1 H3R2me2s colocalizes with H3K4me3 at the IgH locus in Rag2−/− pro-B cells. Chromatin from Rag2−/− Abelson-transformed pro-B cells was immunoprecipitated with the α-H3K4me3 (upper panel), α-H3R2me2sK4me3 (more ...)
H3R2me2s colocalizes with H3K4me3 throughout the mouse genome
The striking similarity between the patterns of enrichment observed for the pan-H3R2me2s, H3K4me3, and H3R2me2sK4me3 antibodies ( and (Matthews et al., 2007
)) suggested that H3R2me2s may be associated with H3K4me3. Indeed, this turns out to be true. We used antibodies to H3K4me3, pan-H3R2me2s, and H3R2me2sK4me3 to perform genome-wide localization analysis (ChIP-seq) in RAG2−/−
Abelson-transformed pro-B cells. In fact, H3R2me2s and H3R2me2sK4me3 both showed a remarkable genome-wide colocalization with H3K4me3. An example is shown for a gene-rich 350 kb region of murine chromosome 19, where these modifications showed very similar patterns of enrichment, generally localizing to the 5’ end of genes (). A closer look at the transcriptional start site (TSS) of a representative gene in this region (Dpf2) revealed that H3R2me2s and H3R2me2sK4me3 are both enriched just upstream and just downstream of the TSS, in a pattern that is nearly identical to the enrichment pattern of H3K4me3 ().
H3R2me2s, H3R2me2sK4me3, and H3K4me3 are colocalized throughout the mouse genome
The enrichment of H3R2me2s and H3R2me2sK4me3 flanking genic transcriptional start sites and the correlation of enrichment with gene expression appears to be general. We stratified all annotated mouse genes into four quartiles according to their expression levels in pro-B cells (Gene Expression Omnibus: GSE15330) (Ng et al., 2009
) and analyzed the signal intensity for pan-H3R2me2s, H3R2me2sK4me3, and H3K4me3 over a 4 kb window centered on the TSS of these genes. Consistent with previous findings (Barski et al., 2007
; Pan et al., 2007
), H3K4me3 is found in two peaks flanking the TSS and its enrichment is positively correlated with gene expression (, left panel). Nearly identical patterns were observed for H3R2me2s (, right panel) and H3R2me2sK4me3 (, center panel).
Consistent with the ChIP-qPCR results (), H3K4me3, H3R2me2s, and H3R2me2sK4me3 are tightly correlated at the IgH locus (), showing broad enrichment covering the region from DQ52 through the JH cluster, with very few sites of enrichment in the VH domain (Figure S2
The ChIP-seq results were further validated by qPCR of 60 randomly selected promoters (Table S2
) after immunoprecipitation of chromatin from the same RAG2−/−
Abelson-transformed pro-B cells with the pan-H3R2me2s, H3R2me2sK4me3, or H3K4me3 antibodies . The pair-wise combination shown in displays a strong positive correlation for all sites. Thus, H3R2me2s and H3K4me3 are tightly colocalized throughout the mouse genome.
H3R2me2s is conserved throughout evolution
To assess the evolutionary conservation of this modification, nuclear extracts from human (Homo sapiens - Hs), mouse (Mus musculus - Mm), frog (Xenopus laevis - Xl), fruit fly (Drosophila melanogaster - Dm), and budding yeast (Saccharomyces cerevisiae - Sc) cells were tested by Western blot analysis. A strong signal was observed for all of these organisms with both the pan-H3R2me2s and the H3R2me2sK4me3 antibodies (). Thus, H3R2me2s is conserved throughout evolution, as far back as budding yeast. Moreover, the strong signal observed with the H3R2me2sK4me3 antibody, which requires the presence of both modifications on the same histone tail, indicates that colocalization of H3R2me2s and H3K4me3 is conserved to some extent throughout evolution.
H3R2me2s exists in S. cerevisiae and is intimately connected to H3K4me3
H3R2me2s colocalizes with H3K4me3 in S. cerevisiae
We used previously published primer sets to interrogate the distribution pattern of these three modifications at the 5’ and 3’ ends of representative genes in S. cerevisiae
: highly transcribed (YLR340W), moderately transcribed (YPR112C and YLR342W), and inactive genes (YPL017C). We observed a striking correlation between the distribution patterns of H3R2me2s, H3K4me3, and H3R2me2sK4me3 (). All three modifications were present at both the 5’ and 3’ ends of the highly transcribed gene (left panel), enriched at the 5’ end of moderately transcribed genes (middle 2 panels), and poorly enriched at an inactive gene (right panel). The primer sets and genes chosen for this analysis are the same that were previously analyzed by ChIP-qPCR for H3R2me2a (Kirmizis et al., 2007
). Therefore, in S. cerevisiae
, H3R2me2s is colocalized with H3K4me3 and anti-correlated with H3R2me2a.
H3K4 is required for H3R2me2s deposition
Mutating arginine-2 of histone H3 to alanine (H3R2A) has been shown to completely abolish trimethylation of H3K4 (Kirmizis et al., 2007
). Given the tight correlation between H3R2me2s and H3K4me3, we asked whether the converse would be true. As shown in , lane 19 we found that no H3R2me2s was detected by Western blot when H3K4 was mutated to alanine (, lane 19). This was not simply due to an inability of the pan-H3R2me2s antibody to recognize its epitope when lysine 4 is mutated to alanine (). Thus H3K4, either unmethylated or in one of its methylated states, is required for H3R2me2s deposition.
Set1 is required for H3R2me2s deposition
In order to identify the methyltransferase responsible for depositing H3R2me2s, we first used a candidate gene approach, expressing shRNA to all known type II arginine methyltransferases in murine cells. Although levels of the methyltransferases were reduced, a reproducible loss of H3R2me2s modification was not obtained (Figure S3
). We then turned to S. cerevisiae
using Western blot analysis with the pan-H3R2me2s antibody to screen six yeast proteins that have a SET domain and nine proteins that contain putative SAM-binding domains (). Surprisingly, the set1Δ strain showed a complete loss of H3R2me2s signal (, lane 2). Since Set1 is the catalytic subunit of COMPASS, the yeast H3K4 methyltransferase, these results suggest two possibilities that are not mutually exclusive: either Set1 is also the catalytic subunit of the H3R2 symmetric dimethyltransferase, or H3K4 methylation is required for H3R2 symmetric dimethylation. The requirement of H3K4 for H3R2me2s deposition is consistent with either interpretation, as H3K4 is required for Set1 binding.
H3R2me2s deposition is greatly reduced in the absence of H3K4me3
Since Set1 is required for mono-, di-, and tri-methylation of H3K4, we analyzed two additional COMPASS mutants, first to confirm that COMPASS is required for H3R2me2s deposition, and second to determine whether a particular K4 methylation state is required. Loss of SWD3 (CPS30) – which destabilizes the COMPASS complex and causes the loss of all H3K4 methylation states – caused a complete loss of H3R2me2s (, lane 4). Since the swd3strain (, lane 4) exhibits the same phenotype as the set1Δ strain (, lane 2), we conclude that the loss of H3R2me2s in the set1Δ strain is likely due to loss of COMPASS activity. We then asked if the loss of SPP1 (CPS40), a COMPASS subunit required for the transition from H3K4me2 to H3K4me3, affected H3R2me2s levels. Loss of SPP1 caused a dramatic reduction in the levels of H3R2me2s (, lane 3). Therefore, either H3K4me3 is required for an H3R2 symmetric dimethyltransferase (distinct from COMPASS) to deposit H3R2me2s, or COMPASS is the H3R2 symmetric dimethyltransferase.
What are the functional roles of H3R2me2s?
The conservation of H3R2me2s and its colocalization with H3K4me3 raises a number of issues. Why are active promoters marked simultaneously by both H3R2me2s and H3K4me3, and what is the function of H3R2me2s? Perhaps H3R2me2s modifies specificities among the multiple H3K4me3-binding proteins. It has remained a puzzle how specificity is achieved when many different proteins recognize H3K4me3. Their ability to bind H3R2me2sK4me3 may be more variable, with the binding of some factors enhanced by the H3R2me2s modification (as with RAG2) while others merely tolerate its presence, and still others may be inhibited (Ramon-Maiques et al., 2007
). In this way, fine-tuning of target site recognition could be achieved, with the contribution of multiple interactions required to achieve the ultimate target site specificity. While RAG2 is the only protein currently known to preferentially bind H3R2me2sK4me3, this may simply reflect the novelty of this modification.
Of note, while we have shown that H3R2me2s and H3K4me3 coexist on individual histone tails at promoters, we cannot rule out the possibility that some nucleosomes contain only one or the other modification (perhaps having had one of the two modifications removed), and these nucleosomes could be recognized by different factors. However, the tight genome-wide correlation between H3R2me2s, H3K4me3, and H3R2me2sK4me3 enrichment levels argues that these two modifications are generally found together on the same histone tail.
An additional possible role for H3R2me2s is in the metabolism of H3K4me3. Previous work in yeast and humans has shown that H3K4me3 and H3R2me2a are mutually exclusive histone modifications (Guccione et al., 2007
; Hyllus et al., 2007
; Kirmizis et al., 2007
). Since symmetric dimethylation of H3R2 would preclude asymmetric dimethylation of this residue, H3R2me2s could facilitate or stabilize the trimethylation of H3K4 by protecting the H3K4me3 methyltransferase binding site from being occluded via asymmetric dimethylation of H3R2. H3R2me2s could also serve to maintain H3K4me3 by preventing demethylases from acting at H3K4. In any event, it is clear from their localization patterns that symmetric and asymmetric methylation of H3R2 serve distinct functions in the cell.
It is increasingly clear that there is a complex interplay between histone modifications. In some cases, one histone modification affects the ability to modify another residue on the same histone (Cheung et al., 2000
; Daujat et al., 2002
; Guccione et al., 2007
; Hyllus et al., 2007
; Kirmizis et al., 2007
; Lo et al., 2000
). In other cases, the modification of one histone affects the modification of another histone in the same nucleosome (Carrozza et al., 2005
; Dover et al., 2002
; Kim et al., 2009
; Ng et al., 2002
; Sun and Allis, 2002
). Histone modifications can also function to combinatorially regulate the binding of chromatin-associated proteins. For example, HP1 binds H3K9me3 only in the absence of H3S10p (Fischle et al., 2005
)). Modifications on different histone tails within the same mononucleosome can also regulate factor binding in cases where a single chromatin binding protein contains multiple histone recognition domains (e.g. BPTF (Ruthenburg et al., 2011
H3R2me2sK4me3 appears to provide another distinct example of histone crosstalk in which the two modifications influence each other’s deposition as well as subsequent factor binding. Here H3R2me2s and H3K4me3 are two nearby residues on the same histone tail that appear to always co-exist (though we cannot rule out the possibility that there are developmental or regulated states where they are separate). Thus, rather than affecting binding of a factor in a binary way (as with HP1 binding H3K9me3 but not H3K9me3S10p (Fischle et al., 2005
)), it appears that all H3K4me3 binders whose binding domains encompass H3R2 will be influenced by the modification state of H3R2. Moreover, either H3R2me2s and H3K4me3 are dependent on the same histone methyltransferase complex (see below) and/or the deposition of H3R2me2s is dependent on the prior deposition of H3K4me3. Thus, we have uncovered a striking new example of the complexity of histone crosstalk.
Interplay between H3K4 and H3R2 methylation
The tight correlation of H3R2me2s and H3K4me3 leads to the obvious questions: what role does H3R2 play in H3K4 trimethylation and what role does H3K4 trimethylation play in H3R2 symmetric dimethylation, what enzymatic machinery is responsible for depositing H3R2me2s, and how are the two events linked? As mentioned above, it is known that the presence of H3R2 is itself required for H3K4 trimethylation (Kirmizis et al., 2007
), an observation which we have independently confirmed. One simple model is that either H3R2me0 or H3R2me1 is required for SPP1 binding, which in turn is required for H3K4 trimethylation, followed by methylation of H3R2 on the same hsitone tail, either by COMPASS or by a distinct H3K4me3-dependent H3R2 methyltransferase. Alternatively, COMPASS could bind to H3R2, either in its unmodified or its mono-methylated form, and first catalyze the symmetric dimethylation of H3R2 and then the trimethylation of H3K4. Both of these models are consistent with our findings that Set1 and Spp1 are required for the generation of H3R2me2s, and the previous observations that H3K4me3 is lost in an H3R2A mutant yeast strains and that Spp1 is highly enriched at sites of H3K4me3 (and therefore, also H3R2me2s and H3R2me2sK4me3) and absent from regions enriched for H3R2me2a (Kirmizis et al., 2007
). These findings also underscore the yin-yang relationship between H3R2me2a and H3R2me2s.
It is worth noting that a number of yeast phenotypes associated with the mutation of H3R2 to alanine have been described. At present, it is impossible to determine whether these phenotypes, including the delayed activation of GAL genes and the loss of silencing in the HMR, HML, telomere and rDNA loci ,reflect the loss of H2R2me2s, H3R2me2a, or H3K4me3, or simply the loss of arginine.
In summary, the tight coupling of H3R2me2s and H3K4me3 is yet another example of the intricate interactions between histone modifications. The impetus for actively seeking evidence that H3R2me2s exists came from predictions based on our prior biochemical studies of the RAG2 PHD finger. We believe this is the first example of a histone modification being sought and identified based on structural and biochemical analyses of a histone recognition domain. The finding that active promoters are marked by H3R2me2sK4me3 will now lead to a rethinking of how the H3R2me2s modification impacts the various H3K4me3 binding proteins, and the importance of proteins like UHRF that solely recognize H3R2 uninfluenced by the H3K4 methylation status.