Search tips
Search criteria 


Logo of jbcThe Journal of Biological Chemistry
J Biol Chem. 2015 October 9; 290(41): 24902–24913.
Published online 2015 August 28. doi:  10.1074/jbc.M115.680389
PMCID: PMC4598999

Structural Basis for the Discriminative Recognition of N6-Methyladenosine RNA by the Human YT521-B Homology Domain Family of Proteins*


N6-Methyladenosine (m6A) is the most abundant internal modification in RNA and is specifically recognized by YT521-B homology (YTH) domain-containing proteins. Recently we reported that YTHDC1 prefers guanosine and disfavors adenosine at the position preceding the m6A nucleotide in RNA and preferentially binds to the GG(m6A)C sequence. Now we systematically characterized the binding affinities of the YTH domains of three other human proteins and yeast YTH domain protein Pho92 and determined the crystal structures of the YTH domains of human YTHDF1 and yeast Pho92 in complex with a 5-mer m6A RNA, respectively. Our binding and structural data revealed that the YTH domain used a conserved aromatic cage to recognize m6A. Nevertheless, none of these YTH domains, except YTHDC1, display sequence selectivity at the position preceding the m6A modification. Structural comparison of these different YTH domains revealed that among those, only YTHDC1 harbors a distinctly selective binding pocket for the nucleotide preceding the m6A nucleotide.

Keywords: crystal structure, isothermal titration calorimetry (ITC), RNA binding protein, RNA methylation, RNA-protein interaction, N6-methyladenosine


Chromatin modifications, including histone modifications and DNA methylation, play a critical role in regulating gene transcription in eukaryotes (1, 2). Like chromatin modifications, RNA can also be modified, and more than 100 RNA modifications have been reported (3). Among them, N6-methyladenosine (m6A)3 is the most abundant internal modification in eukaryotic RNAs (4), and its distribution and functions are just beginning to be appreciated with the aid of recently developed high throughput sequencing technologies (5,8). In mRNA, m6A is found to be enriched in the 3′-UTR and stop codons, suggesting a role in the regulation of gene expression at the post-transcriptional level (6).

Although m6A had been identified in mammals, as well as in prokaryotes and viruses, in the 1970s (9,11), its function had remained largely unknown until recently when its writer, erasers, and readers were identified. Three groups independently reported that the METTL3-METTL14 complex catalyzes the N6-adenosine methylation with high efficiency (7, 12, 13), following the original discovery in 1997 that METTL3 is an N6-adenosine methyltransferase (14). Mutation of the METTL3 ortholog in zebrafish leads to developmental defects and apoptosis (13). FTO (15) and ALKBH5 (16, 17) were found to catalyze the demethylation of m6A in single-stranded RNA. FTO is associated with obesity (18), whereas deficiency of ALKBH5 impairs normal spermatogenesis in mice (16, 17). The Schizosaccharomyces pombe YT521-B homology (YTH) domain-containing protein Mmi1 was reported to be an RNA binding protein and responsible for the selective elimination of meiosis-specific transcripts during vegetative growth (19). Later on, several groups reported that the YTH domain-containing proteins specifically bind m6A RNA (5, 8, 20) and that YTHDF2 destabilizes its targeted mRNA by its interaction with m6A (20). The only known YTH domain-containing protein in Saccharomyces cerevisiae Pho92 also decreases the Pho4 mRNA stability by binding to its 3′-UTR during the phosphate metabolism (21). Very recently, it was reported that the association of YTHDF1 with m6A RNA regulates the mRNA translation efficiency (22).

Recently, several groups presented the structures of YTH domains, including the YTH domain of Zygosaccharomyces rouxii MRB1 with a 7-mer m6ARNA (23), the YTH domain of human YTHDC1 with m6A RNA (24), and the YTH domain of YTHDF2 in its apo form (25) and with the m6A nucleotide (26). In the YTH domain complex structures, the m6A base is positioned in a hydrophobic pocket consisting of two or three aromatic residues. The m6A binding mode is reminiscent of the methyllysine and methylarginine binding mode (27). Furthermore, our structural and binding studies of YTHDC1 also established that YTHDC1 slightly prefers a G nucleotide and disfavors an A nucleotide at the position preceding the m6A nucleotide in RNA (24). In the human genome, there are at least five YTH domain-containing proteins, and we wonder whether these YTH domains have distinct sequence selectivity, similar to the Royal family or PHD domain histone readers (27).

In this study, we systematically characterized the m6A binding ability of four human YTH domains and the YTH domain of yeast Pho92 and determined the crystal structures of the YTH domain of human YTHDF1 and its complex with a 5-mer m6A RNA, and the YTH domain of yeast Pho92 in complex with a 5-mer m6A RNA. Our results indicate that the YTH domains recognized the m6A nucleotide in a conserved mode. Whereas YTHDC1 preferentially bound to the GG(m6A)C sequence (24), the other YTH domains bound to m6A RNA regardless of sequence context. Structural comparison of the YTHDC1 and YTHDF1 complexes revealed that YTHDC1 and YTHDF1 used different binding pockets to accommodate the nucleotide preceding m6A, corresponding to divergent binding grooves for accommodating RNA 5′ to m6A. Our structural and mutagenesis studies also pinpointed the key residues responsible for the differential sequence selectivity.

Experimental Procedures

Cloning, Expression, and Purification of the YTH Domains of YTHDF1, YTHDF2, YTHDC2, and Pho92

The Pho92 YTH domain (amino acids 141–306), two constructs of the human YTHDF1 YTH domain (amino acids 361–559 and amino acids 365–554), the YTHDF2 YTH domain (amino acids 380–579), and MJECL36 (amino acids 1–147) were subcloned into a pET28a-MHL vector. All recombinant proteins were overexpressed at 18 °C as N-terminal His6-tagged protein in Escherichia coli BL21 (DE3) codon plus RIL (Stratagene) and purified by HiTrap nickel column. The His tag was removed by the addition of 0.05 mg of TEV protease per milligram of recombinant protein, followed by the dialysis to remove imidazole at 4 °C for 12 h. The samples were then passed through a nickel-nitrilotriacetic acid column and further purified by size exclusion chromatography (Superdex 75; GE Healthcare). The mutants of the Pho92 and YTHDF1 YTH domains were cloned using site-directed mutagenesis kits (Invitrogen) and were expressed and purified in the same way as wild type recombinant proteins.

Crystallization, Data Collection, and Structure Determination

All crystals were obtained using sitting drop vapor diffusion at 20 °C. Crystals of the YTHDF1 YTH domain (361–559) were obtained by mixing 1.0 μl of purified protein (15 mg/ml) with 1.0 μl of reservoir solution containing 0.2 m potassium thiocyanate and 25% PEG 3350 and subsequent equilibration against 800 μl of reservoir buffer. The purified YTHDF1 YTH domain (365–554, 12 mg/ml) was mixed with the 5-mer GG(m6A)CU RNA in a ratio of 1:2 and incubated for 30 min at 0 °C before co-crystallization. Crystals of the complex were obtained by mixing 1.0 μl of the complex with 1.0 μl of reservoir solution containing 0.1 m Bis-Tris, pH 6.5, 0.2 m sodium chloride, and 25% PEG 3350 and subsequent equilibration against 800 μl of reservoir buffer. The purified Pho92 YTH domain (141–306, 10 mg/ml) was mixed with the 5-mer UG(m6A)CU RNA in a ratio of 1:2 and incubated for 30 min at 0 °C before co-crystallization. A crystal of the complex was obtained by mixing 1.0 μl of the complex with 1.0 μl of well solution containing 0.2 m potassium isocyanate and 20% PEG 3350 and subsequent equilibration against 800 μl of reservoir buffer.

Diffraction intensities were recorded under sample cooling to 100 K at rotating anode or synchrotron sources (see Table 2) and processed with XDS (28) and AIMLESS (29, 30). Molecular replacement was performed with the program PHASER (31). Model refinement was performed in iterations of restrained refinement, geometry validation, interactive building withREFMAC (32), MOLPROBITY (33, 34), and COOT (35), respectively. PDB deposition data were compiled with PDB_EXTRACT (36), and summarizing statistics were extracted with the IOTBX (37) software library. Omit maps of the oligonucleotide complexes were calculated with PHENIX (33).

Data collection and refinement statistics

Specifically, the YTHDF1 “apo” structure was solved by molecular replacement with coordinates related to PDB entry 4R3I. PARROT (38) was used for phase improvement, and BUCCANEER (39) was used for automated model building. The preliminary model was further refined against higher resolution diffraction data from another, isomorphous crystal.

The YTHDF1 oligoribonucleotide complex structure was solved by molecular replacement with coordinates from the YTHDF1 “apo” structure. Some nucleotide link restraints were prepared with JLIGAND (40).

The Pho92 structure was solved by molecular replacement with an ensemble of “apo” models of YTHDC1 (related to PDB entry 4R3H) and YTHDF1 (see above). Density improvement was performed with ARP/WARP (41) and PARROT, automated model building with ARP/wARP (42).

Isothermal Titration Calorimetry (ITC) Experiments

All ITC experiments were performed at 298 K using a MicroCal ITC200 (GE Healthcare). All RNAs used for ITC experiments were purchased from Thermo Fisher Scientific except the unmodified GGACU, which was purchased from Integrated DNA Technologies, Inc. The purity of all purchased RNAs was >90%. All proteins and RNAs were dialyzed or dissolved into the same buffer containing 20 mm Tris, pH 7.5, 150 mm NaCl before the binding experiments. 10–17 injections were recorded by injecting 2 μl of 500–1000 μm of RNAs into a sample well containing 15–60 μm of protein. The concentration of the purified proteins and RNA oligonucleotides were estimated with absorbance spectroscopy (NanoDrop) using the extinction coefficients, A280 and A260, respectively. Each binding isotherm was plotted, analyzed, and fitted in a one-site binding model by Origin Software (MicroCal Inc.) after subtraction of the respective control.

Results and Discussion

YTH Domain Is an Evolutionarily Conserved m6A Reader

The m6A modification has been found from yeast to human, and recent m6A transcriptome analysis reveals that m6A is dominantly present in the RRACU (where R = A/G) consensus motif in mammals (5) and in the RGAC (where R = A/G) consensus motif across yeast species (8). Consistently, a previous study using the borate gel chromatography reveals that the m6A modification occurs within a G(m6A)C or A(m6A)C motif with probabilities of 70 and 30% in mammals, respectively (43). It was reported that both human YTHDF2 and yeast Pho92 (or MRB1) are m6A binding proteins (5, 8, 20), and YTHDF2 prefers a core motif of G(m6A)C (20). In the human genome, there are at least five YTH domain-containing proteins, and our recent structural and biochemical studies showed that YTHDC1 utilizes an aromatic cage to recognize m6A, and it preferentially recognizes the GG(m6A)C sequence somewhat (24). To understand the binding specificity and sequence selectivity of other YTH domain-containing proteins, we cloned the YTH domains of YTHDF1/2, YTHDC2, and S. cerevisiae Pho92 for quantitative binding and structural studies. Our ITC binding results show that both the human and yeast YTH domains recognized m6A-containing RNA but not unmodified RNA, regardless of the RNA length (Table 1), implying that the YTH domain is an evolutionarily conserved m6A-dependent RNA binding domain. Furthermore, the YTHDF1/2, YTHDC2, and Pho92 YTH domains did not display sequence selectivity at the −1 position (the position preceding the m6A nucleotide), unlike the YTHDC1 YTH domain, which preferred GG(m6A)CU over GA(m6A)CU containing 9-mer RNA (Table 1; see Fig. 4, A–D), consistent with our previous findings using 5-mer RNAs or 16-mer RNAs (24). In addition, the YTH domains bound to the 9-mer m6A RNAs with similar binding affinities to the 16-mer m6A RNAs, but much weaker to the 5-mer GG(m6A)CU RNA (Table 1), suggesting that the immediate surrounding nucleotides of m6A also contribute to the YTH domain binding.

Binding affinities of different YTH domains to RNAs
YTHDF1 specifically recognizes m6A RNA without sequence selectivity. A–D, the representative ITC binding curves of YTHDF1 binding to different 16-mer m6A RNAs. A, 16-mer GG(m6A)CU. B, 16-mer GA(m6A)CU. C, 16-mer GC(m6A)CU. D, 16-mer GU(m6A)CU. ...

Crystal Structure of the YTHDF1 YTH Domain

To provide structural insights into the m6A specific recognition and the different sequence selectivity of the YTH domains, we determined the crystal structure of YTHDF1 (amino acids 361–559) at a resolution of 1.97 Å (Table 2). The YHDF1 YTH domain adopted a similar architecture to that of the YTHDC1 YTH domain (24), and the root mean square deviation between the backbone Cα atoms of the YTH domains of YTHDF1 and YTHDC1 was 0.9 Å (calculated from PyMOL software), although the sequence identity between the two YTH domains was only 27%. The YTHDF1 YTH domain consisted of five α helices (α0–α4), six β strands (β1–β6), and one 310 helix following the β5 strand (Fig. 1A). The six β strands were arranged in an atypical β barrel fold in the order of β6-β1-β3-β4-β5-β2. The only parallel β strands were β1 and β3, whereas the others were anti-parallel (Fig. 1, B and C). Three helices (α1–α3) packed against the β barrel and constituted a hydrophobic core together with the six β strands. The α1 helix also packed against α0 and α4 (Fig. 1B). The α0 helix was followed by a long loop linker, whereas the α4 was a kink helix with its axis perpendicular to that of α1 (Fig. 1B).

Overall structure of the YTHDF1 YTH domain. A, sequence alignment of the YTH domains of human YTHDF1-3, YTHDC1-2, and yeast Pho92 by ESPript 3. B, overall structure of the YTHDF1 YTH domain (amino acids 361–559) shown in orange cartoon. The invisible ...

m6A Is Specifically Recognized by an Aromatic Cage of YTHDF1

We attempted co-crystallization of the human YTHDF1 YTH domain with m6A-containing RNAs of different lengths, but only obtained crystals from the complex with the 5-mer GG(m6A)CU RNA, which diffracted to 1.60 Å resolution (Table 2). In the complex structure, nucleotides 5′-GG(m6A) exhibited density for both the backbone and nucleobases. Additional density was interpreted as representing nucleotide (C+1) and the phosphate of (U+2)-3′ (Fig. 2E). The m6A RNA bound to a positively charged concave of YTHDF1 in an extended confirmation (Fig. 2, A and B). The m6A binding pocket of the YTHDF1 YTH domain was similar to that of the reported YTHDC1 YTH domain (24) and was composed of the C termini of β1, α1, β2, the N terminus of α2, and the loop between β4 and β5 (Fig. 2A). Specifically, the m6A was accommodated in a pocket composed of Trp411, Trp465, and Trp470, with the ring planes of Trp411 and Trp470 parallel to each other and perpendicular to the ring plane of Trp465 (Fig. 3, bottom right panel). The N6-methyl bond was coplanar with the m6A purine ring, with the methyl group pointing toward Trp465 and the m6A adenine moiety sandwiched by the rings of Trp411 and Trp470 (Figs. 3, bottom right panel, and and44E). The CH-π interaction between the N6-methyl moiety and the aromatic cage, together with the π-π interactions between the adenine base and the aromatic residues, constituted the basis of the m6A specific recognition (Fig. 3, bottom right panel). Additionally, the m6A adenine formed hydrogen bonds with the YTHDF1 YTH domain, including the N3 of m6A and the backbone NH of Tyr397, and the N6 of m6A and the backbone carbonyl of Cys412 (Figs. 3, bottom right panel, and and44E). The side chain oxygen of Asp401 was located in close proximity of the N1 of m6A (Figs. 3, bottom right panel, and and44E).

Complex structures of the YTHDF1 YTH domain with GG(m6A)CU and the Pho92 YTH domain with UG(m6A)CU. A, overall structure of the YTHDF1 YTH domain (amino acids 365–554) with a 5-mer RNA GG(m6A)CU. The four visible nucleotides (GG(m6A)C) are shown ...
Detailed interactions between the YTH domain of YTHDF1 (orange) and the 5-mer RNA −2GG(m6A)CU+2 (green). Top left panel, overall view. Nucleotides and residues involved in binding to RNA are shown in stick mode. The intermolecular hydrogen bonds ...

To further evaluate the role of the m6A interacting residues of YTHDF1 in binding, we performed mutagenesis and ITC binding experiments. Mutating Trp411, Trp465, or Trp470 to an alanine disrupted the binding of YTHDF1 to the 5-mer GG(m6A)CU RNA, confirming that the aromatic residues were critical for the m6A specific recognition (Table 3).

Binding affinities of different RNA constructs to the wild type and mutants of the YTH domains of YTHDC1, YTHDF1, and Pho92

The Binding Site of the YTHDF1 YTH Domain for the Nucleotide Preceding m6A Is Distinct from That Found in the YTHDC1 YTH Domain

In addition to the m6A specific binding, the m6A RNA also made contacts with the YTHDF1 YTH domain through other nucleotides (Fig. 3, top left panel). To understand the different sequence selectivity of the YTH domains of YTHDC1 and YTHDF1 toward the −1 position of the m6A RNA, we superimposed the crystal structures of YTHDC1 and YTHDF1, respectively, in complex with the 5-mer GG(m6A)CU RNA and found that only the 3′ ends including m6A of the two RNAs coincided, whereas the 5′ ends of the two RNAs deviated (Fig. 5A). In the YTHDF1 complex, G-1 was sandwiched between G-2 and the ring of Tyr397 (Fig. 3, bottom left panel). Tyr397 is conserved in YTHDF1/2/3, and mutating Tyr397 to alanine significantly reduced its binding affinity to both the 5-mer and 16-mer m6A RNA (Table 3). Whereas Tyr397 was important to m6A RNA binding, it appeared able to stack with any nucleotide via π-π interactions, consistent with our binding data of similar affinities of the m6A RNA with different substitutions at the −1 position (Table 1). In the previously published YTHDC1 complex, on the other hand, G-1 interacts with Leu380 and Met438 and forms a direct H-bond with the backbone NH of Val382 and a solvent-mediated H-bond with the side chain of Asn383 (24) (Fig. 5B). Specifically, replacing the G with an adenosine at the −1 position would disrupt the Val382 hydrogen bond or introduce potential steric clashes with Val382, which explains why YTHDC1 disfavors an A nucleotide at the −1 position in the m6A RNA (24). In addition, neither Met438 nor Leu380 is conserved in other YTH domains except YTHDC1 (Figs. 1A and and66A), and mutating either of them in YTHDC1 would diminish its binding to the m6A RNA and attenuate the nucleotide preference at the −1 position for YTHDC1 (Table 3). Therefore, YTHDC1 and YTHDF1 used different binding pockets to accommodate the nucleotide preceding m6A, which leads to different nucleotide selectivity at the −1 position (Figs. 3, bottom left panel, and and55B).

YTHDC1 has a selective binding pocket at the −1 position to accommodate the nucleotide preceding m6A, distinct from other YTH domains.A, superposition of the crystal structures of the YTHDF1 (orange cartoon)-GG(m6A)CU (green cartoon) and YTHC1 ...
Structure comparison betweenYTHDF1, Pho92, and YTHDC2 (PDB entry 2YU6). A, sequence alignment of the YTH domains of human YTHDC1, YTHDC2, and YTHDF1-3. The secondary structures of the YTHDC1 YTH domain are shown at the top of the sequences according to ...

Other Interactions between YTHDF1 and m6A RNA

The base of C+1 in the YTHDF1-GG(m6A)CU complex could stack with the side chain of Arg506 based on the weak electron density. In addition, the 5′-phosphate of C+1 formed a hydrogen bond with the backbone of Asp507 and could interact electrostatically with the side chain of Arg506. Arg506 is highly conserved in the YTH domain (Fig. 1A), and mutating Arg506 to an alanine diminished the binding of YTHDF1 to the 5-mer RNA (Table 3). The 5′-phosphate moiety of U+2 formed a hydrogen bond with the backbone NH of Gly442 and the side chain of Lys395 (Fig. 3, top right panel). Except m6A, no base-specific interactions were observed in the YTHDF1 complex. Nevertheless, the nucleotides surrounding m6A made significant contributions to the m6A RNA binding and deleting U+2 and G-2 significantly reduced binding of m6A RNA (Table 3).

Crystal Structure of Yeast Pho92 with a 5-mer m6A RNA Reveals a Conserved m6A Binding Pocket

To explore whether the m6A binding mode of the YTH domain is also conserved in yeast, we determined the crystal structure of the S. cerevisiae Pho92 YTH domain (141–306) in complex with a 5-mer m6A RNA at a resolution of 1.80 Å (Table 2). We only located m6A of the 5-mer RNA in the electron density map (Fig. 2F). Overall, the Pho92 YTH domain adopted the canonical YTH fold similar to those of the human YTH domains, consisting of three α helices and six β strands (Fig. 1A). The root mean square deviation between the YTH domains of Pho92 and YTHDF1 was 1.4 Å (44) at a sequence identity between YTH domains of 35% (Figs. 1A; ;2,2, C and D; and and66B). Despite the overall similar architectures between the two structures, the Pho92 YTH domain did not contain the helices α0 and α4 at the N and C termini but contained long loops in their places, respectively. Both loops packed against β5 (Figs. 2, C and D, and and66B).

In the Pho92-m6A complex structure, m6A was accommodated in an aromatic cage consisting of Trp177, Trp231, and Tyr237 in a similar way to that of YTHDF1 (Figs. 4G and and66C). Therefore, the YTH domain used a evolutionarily conserved aromatic cage to recognize m6A, and mutating the cage residues Trp177 and Trp231 or the C+1 binding residue Arg273 of Pho92 to an alanine would diminish or disrupt its binding (Table 3).

Sequence alignment between the YTH domains of Pho92 and YTHDC1 showed that the nucleotide binding pocket at −1 position of YTHDC1 was not conserved in Pho92 (Figs. 5D and and6,6, A and D). Like YTHDF1, the Pho92 YTH domain bound to m6A RNA without obvious sequence preference (Table 1). However, Tyr397 of YTHDF1 corresponded to Ser163 in Pho92 (Figs. 1A, ,33D, and and66A); thus Pho92 would potentially recognize the nucleotide at −1 position with a pocket different from either that of YTHDC1 (Fig. 5D) (24) or YTHDF1 (Fig. 3, bottom left panel).

N1 Atom of m6A Nucleotide Favors an Asn over an Asp in the Aromatic Cage Pocket

When comparing the m6A binding pocket of YTHDF1 with that of YTHDC1, we found the Asn367 in YTDC1 is replaced by an Asp (Asp401) in YTHDF1. m6A is neutral in charge. Assuming the N1 atom of m6A in the “free” rather than protonated state (45), deprotonation of Asp401 in YTHDF1 should weaken binding. On the other hand, mutating Asp401 to an asparagine should enhance the binding by introducing a potential hydrogen bond donor to complement “free” N1 of m6A (Fig. 4, E and F). Accordingly, we found the YTHDF1 D401N mutant binds the GG(m6A)CU 5-mer RNA 16-fold stronger than its wild type (Table 3 and Fig. 4, E and F). Furthermore, we mutate the Asn367 of YTHDC1 to an Asp and found the N367D mutant disrupts the binding of the GG(m6A)CU 5-mer RNA (Table 3). Together, we found that the key residue difference could well explain the differential binding ability of the YTH domains toward the m6A RNA.

From the sequence alignment of human YTH domain-containing proteins and yeast Pho92, we found that two of the three aromatic cage residues (such as Trp411 and Trp465 in YTHDF1 and Trp177 and Trp231 in Pho92) were absolutely conserved in all YTH domains (Fig. 1A). In contrast, Trp470 of YTHDF1 (or Tyr237 of Pho92) corresponds to a leucine residue in YTHDC1/2 (Figs. 1A and and66C).

The Fold of the YTH Domain Is Similar to That of the EVE Domain

A structural similarity query on the Fatcat server (46) revealed that the YTH domain exhibits a similar fold to that of the EVE domain (47). Among the experimental EVE domain structures available from the PDB (48), the structure of MJECL36 from Methanocaldococcus jannaschii (PDB entry 2P5D) was the closest to the structure of the YTH domain (47). YTHDF1 and MJECL36 had an root mean square deviation of 3.2 Å when we aligned the backbone Cα atoms of these two structures (Fig. 7A). Interestingly, MJECL36 also harbored a similar aromatic cage comprised of Trp25, Phe79, and Phe90 (Fig. 7B). Despite their structural similarity, we found that the backbone of Asn12 in MJECL36 deviated from that of Tyr397 of YTHDF1, which might disrupt the base specific hydrogen bond with N3 of m6A (Fig. 7B). Consistently, our ITC results indicated that MJECL36 did not bind to the 5-mer GG(m6A)CU RNA (Table 3).

The YTH fold is similar to that of the EVE domain in prokaryotes.A, superposition of the YTHDF1(orange cartoon)-GG(m6A)CU (green cartoon) complex and MJECL36 (PDB entry 2P5D, blue cartoon). The m6A nucleotide is shown as green sticks. B, MJECL36 (PDB ...

Implications for Division of Labor among the YTH Domain Proteins

The subject of the present study, m6A, is the most abundant internal modification underlying various eukaryotic RNAs, including mRNAs, tRNAs, and long noncoding RNAs (49), a reversible epigenetic modification and regulator of RNA metabolism (50). The recent m6A transcriptome (8) analysis reveals that m6A is usually present within the GAC or AAC sequence motifs in mammals, and consistently these motifs are also the substrates for the only known m6A methyltransferase METTL3-METTL14 complex (7, 8, 12, 13). Yeast and mammalian YTH domain-containing proteins have been reported as the readers of the m6A mark (5, 8, 20).

We previously noted that the YTH domain of YTHDC1 harbors a binding pocket that favors a guanosine nucleotide at the −1 position (24). The corresponding pockets of other YTH proteins do not structurally support this selectivity (Figs. 5, B and D, and and6,6, D and E), consistent with no sequence selectivity at the −1 position in non-YTHDC1 proteins (Table 1). On the other hand, YTHDF2 was reported to bind a G(m6A)C motif from a RNA pulldown assay using the total RNA transcripts from the cells (20). The YTH domain of YTHDF2 is 87% identical to that of YTHDF1 in sequence, and all the critical residues in m6A recognition are conserved between YTHDF1 and YTHDF2 (Fig. 1A). Therefore, they should bind a very similar sequence motif, which was confirmed by our ITC binding results (Table 1), i.e. both YTHDF1 and YTHDF2 are m6A specific binders without sequence selectivity at the −1 position (Table 1). One possible explanation for the different motifs obtained from the two different binding assays is: whereas YTHDF1 and YTHDF2 are able to bind to any m6A-containing RNA in vitro, the only known m6A methyltransferase complex, METTL3-METTL14, selectively catalyzes the methylation of the adenosine in a G(m6A)C (70%) or A(m6A)C (30%) context in vivo (7, 12, 13). This would be consistent with a bias toward the G(m6A)C when using the total RNA transcripts from the cells in the RNA pulldown assay (20). On the basis of our binding and structural studies, we propose that the preferred physiological ligands of YTHDC1 could be G(m6A)C RNAs. The other human YTH domains could recognize both G(m6A)C and A(m6A)C RNAs as their ligands.

Author Contributions

C. X. and J. M. conceived the research. C. X. cloned, expressed, and purified all YTH domain proteins and their mutants listed in the manuscript with the assistance from K. L. and P. L. C. X. performed extensive structural experiments and analyzed the data. C. X. and K. L. performed ITC binding experiments. C.X. and J. M. wrote the manuscript. All authors discussed the manuscript. H. A. and M. S contributed to the analyzing and discussion of the binding characteristics of YTH domain proteins. J. M. supervised the project.


We thank Dr. Wolfram Tempel and Dr. John R. Walker for the assistance with data collection and structure determination and for reviewing the YTHDF1 crystal structure. We also thank Dr. Cheryl Arrowsmith and Dr. Adelinda Yee for kindly providing the Methanocaldococcus jannaschii DNA and yeast DNA.

*The SGC is a registered charity (number 1097737) that receives funds from AbbVie, Bayer Pharma AG, Boehringer Ingelheim, Canada Foundation for Innovation, Eshelman Institute for Innovation, Genome Canada, Innovative Medicines Initiative (the European Federation of Pharmaceutical Industries and Associations (European Union)) under ULTRA-DD Grant 115766, Janssen, Merck & Co., Novartis Pharma AG, Ontario Ministry of Economic Development and Innovation, Pfizer, Sao Paulo Research Foundation-Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP), Takeda, and the Wellcome Trust.

The atomic coordinates and structure factors (codes 4RCI, 4RCJ, and 4RCM) have been deposited in the Protein Data Bank (

3The abbreviations used are:

YT521-B homology
isothermal titration calorimetry
Protein Data Bank.


1. Kouzarides T. (2007) Chromatin modifications and their function. Cell 128, 693–705 [PubMed]
2. Wu H., Zhang Y. (2014) Reversing DNA methylation: mechanisms, genomics, and biological functions. Cell 156, 45–68 [PMC free article] [PubMed]
3. Machnicka M. A., Milanowska K., Osman Oglou O., Purta E., Kurkowska M., Olchowik A., Januszewski W., Kalinowski S., Dunin-Horkawicz S., Rother K. M., Helm M., Bujnicki J. M., Grosjean H. (2013) MODOMICS: a database of RNA modification pathways: 2013 update. Nucleic Acids Res. 41, D262–D267 [PMC free article] [PubMed]
4. Bokar J. A. (2005) Fine-tuning of RNA Functions by Modification and Editing (Grosjean H., editor. , ed) pp. 141–177, Springer-Verlag, Berlin
5. Dominissini D., Moshitch-Moshkovitz S., Schwartz S., Salmon-Divon M., Ungar L., Osenberg S., Cesarkas K., Jacob-Hirsch J., Amariglio N., Kupiec M., Sorek R., Rechavi G. (2012) Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq. Nature 485, 201–206 [PubMed]
6. Meyer K. D., Saletore Y., Zumbo P., Elemento O., Mason C. E., Jaffrey S. R. (2012) Comprehensive analysis of mRNA methylation reveals enrichment in 3′ UTRs and near stop codons. Cell 149, 1635–1646 [PMC free article] [PubMed]
7. Wang Y., Li Y., Toth J. I., Petroski M. D., Zhang Z., Zhao J. C. (2014) N6-Methyladenosine modification destabilizes developmental regulators in embryonic stem cells. Nat. Cell Biol. 16, 191–198 [PMC free article] [PubMed]
8. Schwartz S., Agarwala S. D., Mumbach M. R., Jovanovic M., Mertins P., Shishkin A., Tabach Y., Mikkelsen T. S., Satija R., Ruvkun G., Carr S. A., Lander E. S., Fink G. R., Regev A. (2013) High-resolution mapping reveals a conserved, widespread, dynamic mRNA methylation program in yeast meiosis. Cell 155, 1409–1421 [PMC free article] [PubMed]
9. Dubin D. T., Taylor R. H. (1975) The methylation state of poly A-containing messenger RNA from cultured hamster cells. Nucleic Acids Res. 2, 1653–1668 [PMC free article] [PubMed]
10. Wei C. M., Gershowitz A., Moss B. (1975) Methylated nucleotides block 5′ terminus of HeLa cell messenger RNA. Cell 4, 379–386 [PubMed]
11. Perry R. P., Kelley D. E., Friderici K., Rottman F. (1975) The methylated constituents of L cell messenger RNA: evidence for an unusual cluster at the 5′ terminus. Cell 4, 387–394 [PubMed]
12. Liu J., Yue Y., Han D., Wang X., Fu Y., Zhang L., Jia G., Yu M., Lu Z., Deng X., Dai Q., Chen W., He C. (2014) A METTL3-METTL14 complex mediates mammalian nuclear RNA N6-adenosine methylation. Nat. Chem. Biol. 10, 93–95 [PMC free article] [PubMed]
13. Ping X. L., Sun B. F., Wang L., Xiao W., Yang X., Wang W. J., Adhikari S., Shi Y., Lv Y., Chen Y. S., Zhao X., Li A., Yang Y., Dahal U., Lou X. M., Liu X., Huang J., Yuan W. P., Zhu X. F., Cheng T., Zhao Y. L., Wang X., Rendtlew Danielsen J. M., Liu F., Yang Y. G. (2014) Mammalian WTAP is a regulatory subunit of the RNA N6-methyladenosine methyltransferase. Cell Res. 24, 177–189 [PMC free article] [PubMed]
14. Bokar J. A., Shambaugh M. E., Polayes D., Matera A. G., Rottman F. M. (1997) Purification and cDNA cloning of the AdoMet-binding subunit of the human mRNA (N6-adenosine)-methyltransferase. RNA 3, 1233–1247 [PubMed]
15. Jia G., Fu Y., Zhao X., Dai Q., Zheng G., Yang Y., Yi C., Lindahl T., Pan T., Yang Y. G., He C. (2011) N6-Methyladenosine in nuclear RNA is a major substrate of the obesity-associated FTO. Nat. Chem. Biol. 7, 885–887 [PMC free article] [PubMed]
16. Xu C., Liu K., Tempel W., Demetriades M., Aik W., Schofield C. J., Min J. (2014) Structures of human ALKBH5 demethylase reveal a unique binding mode for specific single-stranded N6-methyladenosine RNA demethylation. J. Biol. Chem. 289, 17299–17311 [PMC free article] [PubMed]
17. Zheng G., Dahl J. A., Niu Y., Fedorcsak P., Huang C. M., Li C. J., Vågbø C. B., Shi Y., Wang W. L., Song S. H., Lu Z., Bosmans R. P., Dai Q., Hao Y. J., Yang X., Zhao W. M., Tong W. M., Wang X. J., Bogdan F., Furu K., Fu Y., Jia G., Zhao X., Liu J., Krokan H. E., Klungland A., Yang Y. G., He C. (2013) ALKBH5 is a mammalian RNA demethylase that impacts RNA metabolism and mouse fertility. Mol. Cell 49, 18–29 [PMC free article] [PubMed]
18. Frayling T. M., Timpson N. J., Weedon M. N., Zeggini E., Freathy R. M., Lindgren C. M., Perry J. R., Elliott K. S., Lango H., Rayner N. W., Shields B., Harries L. W., Barrett J. C., Ellard S., Groves C. J., Knight B., Patch A. M., Ness A. R., Ebrahim S., Lawlor D. A., Ring S. M., Ben-Shlomo Y., Jarvelin M. R., Sovio U., Bennett A. J., Melzer D., Ferrucci L., Loos R. J., Barroso I., Wareham N. J., Karpe F., Owen K. R., Cardon L. R., Walker M., Hitman G. A., Palmer C. N., Doney A. S., Morris A. D., Smith G. D., Hattersley A. T., McCarthy M. I. (2007) A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity. Science 316, 889–894 [PMC free article] [PubMed]
19. Harigaya Y., Tanaka H., Yamanaka S., Tanaka K., Watanabe Y., Tsutsumi C., Chikashige Y., Hiraoka Y., Yamashita A., Yamamoto M. (2006) Selective elimination of messenger RNA prevents an incidence of untimely meiosis. Nature 442, 45–50 [PubMed]
20. Wang X., Lu Z., Gomez A., Hon G. C., Yue Y., Han D., Fu Y., Parisien M., Dai Q., Jia G., Ren B., Pan T., He C. (2014) N6-Methyladenosine-dependent regulation of messenger RNA stability. Nature 505, 117–120 [PMC free article] [PubMed]
21. Kang H. J., Jeong S. J., Kim K. N., Baek I. J., Chang M., Kang C. M., Park Y. S., Yun C. W. (2014) A novel protein, Pho92, has a conserved YTH domain and regulates phosphate metabolism by decreasing the mRNA stability of PHO4 in Saccharomyces cerevisiae. Biochem. J. 457, 391–400 [PubMed]
22. Wang X., Zhao B. S., Roundtree I. A., Lu Z., Han D., Ma H., Weng X., Chen K., Shi H., He C. (2015) N6-Methyladenosine modulates messenger RNA translation efficiency. Cell 161, 1388–1399 [PMC free article] [PubMed]
23. Luo S., Tong L. (2014) Molecular basis for the recognition of methylated adenines in RNA by the eukaryotic YTH domain. Proc. Natl. Acad. Sci. U.S.A. 111, 13834–13839 [PubMed]
24. Xu C., Wang X., Liu K., Roundtree I. A., Tempel W., Li Y., Lu Z., He C., Min J. (2014) Structural basis for selective binding of mA RNA by the YTHDC1 YTH domain. Nat. Chem. Biol. 10, 927–929 [PubMed]
25. Zhu T., Roundtree I. A., Wang P., Wang X., Wang L., Sun C., Tian Y., Li J., He C., Xu Y. (2014) Crystal structure of the YTH domain of YTHDF2 reveals mechanism for recognition of N6-methyladenosine. Cell Res. 24, 1493–1496 [PMC free article] [PubMed]
26. Li F., Zhao D., Wu J., Shi Y. (2014) Structure of the YTH domain of human YTHDF2 in complex with an m6A mononucleotide reveals an aromatic cage for m6A recognition. Cell Res. 24, 1490–1492 [PMC free article] [PubMed]
27. Patel D. J., Wang Z. (2013) Readout of epigenetic modifications. Annu. Rev. Biochem. 82, 81–118 [PMC free article] [PubMed]
28. Kabsch W. (2010) XDS. XDS. Acta Crystallogr. D Biol. Crystallogr. 66, 125–132 [PMC free article] [PubMed]
29. Evans P. R., Murshudov G. N. (2013) How good are my data and what is the resolution? Acta Crystallogr. D Biol. Crystallogr. 69, 1204–1214 [PMC free article] [PubMed]
30. Winn M. D. (2003) An overview of the CCP4 project in protein crystallography: an example of a collaborative project. J. Synchrotron Radiat. 10, 23–25 [PubMed]
31. McCoy A. J., Grosse-Kunstleve R. W., Adams P. D., Winn M. D., Storoni L. C., Read R. J. (2007) Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 [PubMed]
32. Murshudov G. N., Skubák P., Lebedev A. A., Pannu N. S., Steiner R. A., Nicholls R. A., Winn M. D., Long F., Vagin A. A. (2011) REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D Biol. Crystallogr. 67, 355–367 [PMC free article] [PubMed]
33. Adams P. D., Afonine P. V., Bunkóczi G., Chen V. B., Davis I. W., Echols N., Headd J. J., Hung L. W., Kapral G. J., Grosse-Kunstleve R. W., McCoy A. J., Moriarty N. W., Oeffner R., Read R. J., Richardson D. C., Richardson J. S., Terwilliger T. C., Zwart P. H. (2010) PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 [PMC free article] [PubMed]
34. Chen V. B., Arendall W. B. 3rd, Headd J. J., Keedy D. A., Immormino R. M., Kapral G. J., Murray L. W., Richardson J. S., Richardson D. C. (2010) MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D Biol Crystallogr 66, 12–21 [PMC free article] [PubMed]
35. Emsley P., Lohkamp B., Scott W. G., Cowtan K. (2010) Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 66, 486–501 [PMC free article] [PubMed]
36. Yang H., Guranovic V., Dutta S., Feng Z., Berman H. M., Westbrook J. D. (2004) Automated and accurate deposition of structures solved by x-ray diffraction to the Protein Data Bank. Acta Crystallogr. D Biol. Crystallogr. 60, 1833–1839 [PubMed]
37. Gildea R. J., Bourhis L. J., Dolomanov O. V., Grosse-Kunstleve R. W., Puschmann H., Adams P. D., Howard J. A. (2011) iotbx.cif: a comprehensive CIF toolbox. J. Appl. Crystallogr. 44, 1259–1263 [PMC free article] [PubMed]
38. Cowtan K. (2010) Recent developments in classical density modification. Acta Crystallogr. D Biol. Crystallogr. 66, 470–478 [PMC free article] [PubMed]
39. Cowtan K. (2006) The Buccaneer software for automated model building: 1. tracing protein chains. Acta Crystallogr. D Biol. Crystallogr. 62, 1002–1011 [PubMed]
40. Lebedev A. A., Young P., Isupov M. N., Moroz O. V., Vagin A. A., Murshudov G. N. (2012) JLigand: a graphical tool for the CCP4 template-restraint library. Acta Crystallogr. D Biol. Crystallogr. 68, 431–440 [PMC free article] [PubMed]
41. Perrakis A., Sixma T. K., Wilson K. S., Lamzin V. S. (1997) wARP: improvement and extension of crystallographic phases by weighted averaging of multiple-refined dummy atomic models. Acta Crystallogr. D Biol. Crystallogr. 53, 448–455 [PubMed]
42. Langer G., Cohen S. X., Lamzin V. S., Perrakis A. (2008) Automated macromolecular model building for X-ray crystallography using ARP/wARP version 7. Nat. Protoc. 3, 1171–1179 [PMC free article] [PubMed]
43. Wei C. M., Moss B. (1977) Nucleotide sequences at the N6-methyladenosine sites of HeLa cell messenger ribonucleic acid. Biochemistry 16, 1672–1676 [PubMed]
44. Krissinel E., Henrick K. (2004) Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. D Biol. Crystallogr. 60, 2256–2268 [PubMed]
45. Kampf G., Kapinos L. E., Griesser R., Lippert B., Sigel H. (2002) Comparison of the acid-base properties of purine derivatives in aqueous solution. Determination of intrinsic proton affinities of various basic sites. J. Chem. Soc. Perkin Trans. 2, 1320–1327
46. Ye Y., Godzik A. (2004) FATCAT: a web server for flexible structure comparison and structure similarity searching. Nucleic Acids Res. 32, W582–W585 [PMC free article] [PubMed]
47. Bertonati C., Punta M., Fischer M., Yachdav G., Forouhar F., Zhou W., Kuzin A. P., Seetharaman J., Abashidze M., Ramelot T. A., Kennedy M. A., Cort J. R., Belachew A., Hunt J. F., Tong L., Montelione G. T., Rost B. (2009) Structural genomics reveals EVE as a new ASCH/PUA-related domain. Proteins 75, 760–773 [PMC free article] [PubMed]
48. Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., Shindyalov I. N., Bourne P. E. (2000) The Protein Data Bank. Nucleic Acids Res. 28, 235–242 [PMC free article] [PubMed]
49. Pan T. (2013) N6-Methyl-adenosine modification in messenger and long non-coding RNA. Trends Biochem. Sci. 38, 204–209 [PMC free article] [PubMed]
50. Fu Y., Dominissini D., Rechavi G., He C. (2014) Gene expression regulation mediated through reversible m6A RNA methylation. Nat. Rev. Genet. 15, 293–306 [PubMed]

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology