|Home | About | Journals | Submit | Contact Us | Français|
A general mass spectrometry-based screen for unusually hydrophobic cellular small-molecule RNA conjugates revealed geranylated RNA in E. coli, Enterobacter aerogenes, Pseudomonas aeruginosa, and Salmonella thyphimurium. The geranyl group is conjugated to the sulfur atom in two 5-methylaminomethyl-2-thiouridine nucleotides. These geranylated nucleotides occur in the first anticodon position of tRNAGluUUC, tRNALysUUU, tRNAGlnUUG at a frequency of up to 6.7% (~400 geranylated nucleotides per cell). RNA geranylation levels can be increased or abolished by mutation or deletion of the selU (ybbB) gene in E. coli, and purified SelU protein in the presence of geranyl pyrophosphate and tRNA can produce geranylated tRNA. The presence or absence of the geranyl group in tRNAGluUUC, tRNALysUUU, and tRNAGlnUUG affects codon bias and frameshifting during translation. These RNAs represent the first reported examples of oligoisoprenylated cellular nucleic acids.
The known roles of RNA in living systems have rapidly expanded over the past three decades. The development and application of methods including bioinformatic analysis, oligonucleotide microarray technology, and high-throughput sequencing have revealed that RNA is an active contributor to many biological processes including gene regulation,1-2 catalysis,3 and viral defense.4 While the known biological repertoire of cellular RNA has grown dramatically, the observed chemical diversity of RNA has remained relatively unchanged over the past several decades. Indeed, approximately two-thirds of all known RNA modifications were reported before 1980.5 The application of modern analytical methods to either a small subset of RNAs or to the study of specific modifications of RNA have enabled the recent discovery of agmatine-modified tRNA in archaea6-7 and to the elucidation of the biosynthesis of several nucleosides.8-10 We recently developed general and highly sensitive chemical screens to reveal cellular small-molecule RNA conjugates in a broad and unbiased manner that is not limited to any particular class of modifications or any specific subset of RNAs. These screens were previously designed to capture chemically reactive11 and hydrophilic12 novel RNA nucleotides, resulting in the discovery of coenzyme A-linked RNA and NAD-linked RNA isolated from several bacteria.11-12
Large, hydrophobic small molecules conjugated to cellular RNAs could potentially influence the structure, function, or subcellular localization of RNA. Such conjugates could also be evolutionary fossils from ancient RNA-mediated lipid synthesis, as has been previously speculated13. We therefore implemented a chemical screen to isolate and identify unusually hydrophobic small molecules conjugated to biological RNA. Here we report the discovery, structural elucidation, and initial biological characterization of geranylated RNAs from bacterial cells. These findings represent the first reported examples of oligoisoprenylated nucleotides from cells.
We modified our previously reported small molecule-RNA conjugate screen12 to increase the likelihood of detecting unusually hydrophobic nucleotides. The complete experimental details are provided in the Methods and the Supplementary Information. Briefly, we isolated total cellular RNA isolated in a manner that minimizes contamination with small molecules, DNA, or protein. We digested the resulting RNA pool with nuclease P1 and analyzed the products by high-resolution LC/MS. To ensure that the identified species are covalently attached to RNA, we treated an identical sample of RNA with heat-inactivated nuclease P1 and analyzed by LC/MS in parallel (Fig. 1a, Supplementary Methods). Small molecules conjugated to RNA generate non-canonical nucleotides (or fragments thereof) in the nuclease P1-digested sample that are absent in the control sample treated with inactivated nuclease P1. To focus the screen on hydrophobic RNA modifications, we (i) performed size-exclusion chromatography after digestion in the presence of organic solvents, (ii) implemented a C8 reverse-phase column for LC/MS analysis, and (iii) only considered species eluting later than AMP-Trp, the most hydrophobic of the aminoacyl adenylates, for further evaluation. The application of this screen to RNA isolated from E. coli revealed eight unknown species that are more hydrophobic than AMP-Trp and that are enriched at least three-fold in the active nuclease P1 samples relative to the control samples (Supplementary Results, Supplementary Table 1).
We chose two of these species, with [M-H]− m/z = 824.200 and [M-H]− m/z = 868.189, for further investigation (Fig. 1b) based on their hydrophobicity, their high degree of enrichment, and their relatedness; MS/MS fragmentation of either species resulted in ions consistent with uridine monophosphate, an unmodified ribose, and an unknown group (307.171 or 351.161 Da) (Fig. 2a). We hypothesized that the unknown groups were two novel nucleobases, and that breakage of the glycosidic bond between the novel nucleobase and a ribose gives rise to the observed UMP-ribose adduct ([M-H]− m/z = 517.029). Further MS/MS/MS fragmentation of the UMP-ribose adduct demonstrates that the dinucleotides are of the sequence 5′-X-U-3′ (Supplementary Fig. 1).
To determine the molecular formula of the unknown nucleotides, we isolated RNA from E. coli grown with 13C-glucose as the sole carbon source or 15N-ammonium chloride as the sole nitrogen source. The resulting increase in mass of the parent and the daughter ions revealed the molecular formulas of the unknown dinucleotides and the corresponding unknown nucleobases: 825.2 Da dinucleotide = C30H45N5O16P2S and 307.2 Da nucleobase = C16H25N3OS; 869.2 Da dinucleotide = C31H45N5O18P2S and 351.2 Da nucleobase = C17H25N3O3S (Supplementary Fig. 2). These empirical formulas are consistent with 5′-X-U-3′ structures. Based on the high degree of hydrophobicity of the parental species, we reasoned that the unknown nucleobases likely contain a lipid-like group. The empirical formulas also suggested that the two nucleobases likely differ by the presence or absence of a carboxylic acid.
To further characterize the structures of the unknown nucleobases, we performed MS/MS experiments in positive and negative ion mode on both unlabeled and isotopically labeled samples (Fig. 2b,c, Supplementary Fig. 3). Based on the isotope labeling data, a key fragment, [M+H]+ m/z = 137.14, had a predicted neutral molecular formula of C10H16, corresponding to three degrees of unsaturation (Fig. 2d). We hypothesized this ion might arise from the elimination of a geranyl group attached to the novel nucleobases (Supplementary Fig. 4a). The presence of a characteristic ion fragment with [M+H]+ m/z = 81.07 in the MS/MS of both nucleobases which we also observed in the MS/MS spectra of geraniol, geranyl acetate, farnesol, and geranylgeraniol, four potentially related lipid-like molecules analyzed for comparison, supports this hypothesis (Supplementary Fig. 4b).
The removal of a geranyl group (C10H16) from the empirical formulae of the unknown nucleobases resulted in molecular formulas matching two known modified nucleobases 5-methylaminomethyl-2-thiouridine (mnm5s2U) and 5-carboxymethylaminomethyl-2-thiouridine (cmnm5s2U). We therefore speculated that the unknown nucleobases could be geranylated mnm5s2U and geranylated cmnm5s2U. These structures are consistent with all ion species arising from the fragmentation of the nucleobases (Supplementary Table 2). Importantly, the [M+H]+ m/z = 277.139 fragment observed in the positive mode MS/MS of both nucleobases, a fragment consistent with the presence of the geranyl group and the absence of the (carboxy)methylamino group, strongly suggests that the proposed geranyl group is not linked to the exocyclic amine of either novel nucleobase (Fig. 2d).
To confirm that the unknown nucleotides were derivatives of mnm5s2U and cmnm5s2U and to verify the regiochemistry of the geranyl group, we prepared RNA from E. coli strains lacking genes involved in the biosynthesis of mnm5s2U and cmnm5s2U. E. coli ΔmnmA or ΔmnmE lack the ability to install sulfur at the C2 of uridine or to incorporate the methylaminomethyl group at the C5 position, respectively.14-15 Neither sample contained detectable quantities of the previously observed unknown nucleotides (Supplementary Fig. 5), suggesting that the geranyl group is bound to either the sulfur atom or to the methylaminomethyl group. While RNA isolated from E. coli ΔmnmA did not contain any corresponding geranylated product (e.g., geranyl-mnm5U-U), a geranyl-2-thiouridine containing dinucleotide (ges2U-U) was present in RNA from E. coli ΔmnmE strains. These results indicate that the 2-thio position is either the position of geranylation (Fig. 3a), or is necessary for recognition by enzyme(s) responsible for installing the geranyl group.
To test the hypothesis that geranylation occurs at the 2-thio position, we developed a synthetic route to geranyl-2-thiouridine (ges2U, 1) from 2-thiouridine (Supplementary Fig. 6). Due to the challenge of discriminating the desired S-alkylation product from undesired N- and O-substituted adducts, we used 2-dimensional NMR of synthetic ges2U as well as X-ray crystallography of the nucleobase (geranyl-2-thiouracil, Supplementary Data Set 1) derived from synthetic ges2U to unambiguously confirm the structure of the synthetic ges2U (Supplementary Figs. 15-22). To generate a biological sample for comparison, we subjected RNA from E. coli ΔmnmE to P1 digestion, base hydrolysis, and dephosphorylation. The resulting biological sample and the synthetic ges2U exhibited identical retention times and eluted as a single peak upon co-injection and HPLC (Fig. 3b). In addition, side-by-side MS/MS comparison at varying collision energies16 revealed an identical ion fragmentation pattern for both molecules (Fig. 3c). Collectively, these results unambiguously indicate that these novel E. coli nucleotides contain a geranyl group that is linked to the 2-thio group, and confirm that our proposed structures for cellular mnm5ges2U (2) and cmnm5ges2U (3) are correct (Fig. 3a). We further used the synthetic standard of ges2U to estimate the amount of cellular ges2U in E. coli ΔmnmE cells by generating a standard curve with known quantities of authentic ges2U. Ion counts of ges2U in digested E. coli ΔmnmE RNA correspond to 398±81 molecules of ges2U per E. coli cell under the growth conditions used (Supplementary Fig. 7).
Next we sought to identify the RNA sequence(s) that contained the geranylated nucleotides. We fractionated total RNA by size and analyzed individual fractions by P1 digestion followed by LC/MS. Both modifications were present on RNAs of length 50 to 80 nucleotides (Supplementary Fig. 8), suggesting their possible attachment to tRNA. Considering that mnm5s2U is known to be present at the first uridine (U34) in the anticodon loop of the tRNAGluUUC and tRNALysUUU, and that tRNAGlnUUG contains a potentially related s2U derivative at the same position17-19, we speculated that the geranylated nucleotides might also be present at U34 of these tRNAs. Because these three tRNAs all contain a uridine at position 35 as well, this hypothesis is consistent with our observation of geranylated bases in 5′-X-U-3′ dinucleotides.
To test this hypothesis, we used complimentary oligonucleotides specific for tRNAGluUUC, tRNALysUUU, and tRNAGlnUUG to isolate these tRNAs from whole E. coli tRNA. In addition, we isolated tRNAAsnGUU, which does not contain any 2-thiouridine derivatives and therefore should not contain any geranylated nucleotides, as a negative control.20 P1 nuclease digestion and LC/MS analysis revealed that geranylated mnm5s2U is primarily present in tRNAGluUUC and tRNALysUUU, and that geranylated cmnm5s2U is present on tRNAGlnUUG (Fig. 4a). We observed that tRNAGlnUUG also contains mnm5ges2U to a lower extent, suggesting that this tRNA contains either mnm5s2U or cmnm5s2U at U34. As expected, we observed no geranylated nucleotides in the isolated tRNAAsnGUU.
To probe the position of the geranylated nucleotide within these tRNAs, we digested these isolated tRNAs with RNase T1, which specifically cleaves after G. We observed fragments corresponding to both (c)mnm5s2U and (c)mnm5ges2U at position U33 or U34 (Supplementary Fig. 9). Samples digested with RNase A, which cleaves specifically after pyrimidines,21 contained the dinucleotides mnm5ges2U-U and cmnm5ges2U-U. These results are consistent with geranylation of the anticodon loop since uridine is immediately 5′ of the anticodon in the tRNAs in question. MS/MS analysis of RNase T1 digestion products provided a partial sequence22 of the anticodon loop and definitively confirmed U34 as site of geranylation in tRNAGluUUC, tRNALysUUU and tRNAGlnUUG. (Fig. 4b, Supplementary Fig. 10). Collectively, these findings suggest that geranylated (c)mnm5s2U co-exists with (c)mnm5s2U at position U34 in tRNAGluUUC, tRNALysUUU, and tRNAGlnUUG. Based on previously reported abundances of tRNAs in E. coli cells23-24, we estimate that approximately 2.8-6.7% of these three tRNAs are geranylated.
To characterize the evolutionary conservation of these geranylated nucleotides, we analyzed whole cellular RNA or tRNA from eleven different organisms. Both geranylated nucleotides were present in Enterobacter aerogenes, Pseudomonas aeruginosa, and Salmonella thyphimurium but were undetectable in Vibrio fischeri, Bacillus subtilis, and several samples of eukaryotic RNA (Supplementary Table 3). These results are consistent with a previous report describing the presence of an uncharacterized C10H17 group, noted by the authors to have the same molecular formula as a geranyl group, on mnm5s2U in S. typhimurium carrying a mutation (G67E) in the sufY gene.17 SelU, the E. coli homolog of sufY, is known to catalyze the substitution of the sulfur at the C2 position of (c)mnm5s2U with selenium when selenium is present at high concentrations25. Notably, the presence or absence of homologs of sufY in each of the eleven organisms tested perfectly predicts the observed presence or absence of geranylated RNA (Supplementary Table 3), suggesting the potential role of this gene in RNA geranylation.
To test the role of selUin producing geranylated RNA, we isolated RNA from a selU-deficient E. coli strain (E. coli ΔselU), as well as E. coli ΔselU bearing a plasmid with a complementary copy of either selU(wt) or selU(G67E). In E. coli ΔselU we observed neither mnm5ges2U nor cmnm5ges2U. E. coli ΔselU complemented with selU(wt) contained similar levels of geranylated RNA as in E. coli, while complementation with selU(G67E) resulted in increased amounts of mnm5ges2U and cmnm5ges2U (Fig. 5a). Collectively, these results confirm that selU is involved in the production of geranylated RNA. We obtained similar results when complementing E. coli ΔselU with S. typhimurium sufY(G67E), S. typhimurium sufY(G67R) (a second mutation with similar phenotype as G67E17), and E. coli selU(G67R) (Supplementary Fig. 11). Since selU is part of the biosynthetic pathway of 5-methylaminomethyl-2-selenouridine (mnm5se2U) and 5-carboxymethylaminomethyl-2-selenouridine (cmnm5se2U), we analyzed RNA from E. coli grown under varying selenium concentrations.25 Interestingly, selenation occurs at the expense of geranylation at selenium concentrations above 10 nM (Supplementary Fig. 12).
To determine whether SelU directly catalyses geranylation or plays a regulatory role in the reaction, we expressed and purified E. coli SelU. Upon incubation of E. coli tRNA in the presence or absence of purified SelU and geranyl pyrophosphate, we observed a 6- to 7-fold increase of mnmges2U and cmnm5ges2U in samples containing SelU and geranyl pyrophosphate (Fig. 5b), thus confirming that SelU geranylates tRNA containing (c)mnm5s2U using geranyl pyrophosphate as co-substrate.
We hypothesized that a large hydrophobic modification at the first position of an anticodon could affect recognition of the third position in the corresponding codon during translation. tRNAGluUUC, tRNALysUUU and tRNAGlnUUG recognize the codons GAA/GAG, AAA/AAG, and CAA/CAG, respectively. Although CAG is further decoded by a separate tRNAGlnCUG, only tRNAGluUUC and tRNALysUUU decode GAA/GAG and AAA/AAG, respectively. Therefore, we compared the translation efficiency of the glutamate codons GAA and GAG at varying geranylation levels using a luciferase reporter containing a six-codon leader sequence of either (GAA)6 or (GAG)6 after the ATG start codon. Consistent with literature reports, we observed that E. coli tRNAGluUUC decodes GAA 10-fold more efficiently than GAG.26-28 While this ratio did not change significantly for E. coli ΔselU and E. coli ΔselU complemented with selU(wt), complementation with selU(G67E) strongly reduced the translation efficiency of GAA codons, altering the relative GAA:GAG translation efficiency ratio from 13:1 to 6:1 (Fig. 5c). We verified that these differences do not arise from changes in transcript levels, which did not differ significantly among all strains assayed (Supplementary Fig. 13). Collectively, these findings indicate that tRNA geranylation affects translational efficiency in a codon-dependent manner.
Next, we investigated whether tRNA geranylation levels can affect the efficiency of frameshifting during translation. In S. thyphimurium, the sufY(G67E) mutant resulting in the uncharacterized C10H17 fragment has been previously reported to enable the readthrough of a single C insertion in the his operon17. We studied the effect of varying tRNA geranylation levels on frameshift of the nucleotide sequences GCC AAGC, which promotes +1 frameshifting, and A AAA AAG, which promotes −1 frameshifting, at the lysine codon AAG.29-30 The sequences were inserted between the genes of glutathione S-transferase (GST) and maltose-binding protein (MBP). The inserted sequence was followed by a stop codon in frame with the GST-encoding sequence, while the MBP-encoding sequence was in the +1 or −1 frame, such that production of MBP requires a +1 or −1 translational frameshift. High levels of geranylation in E. coli ΔselU complemented with selU(G67E) induced the +1 frameshift (GCC AAGC) by >10-fold, and inhibited the −1 frameshift (A AAA AAG) by >10-fold relative to wild-type E. coli. For the +1 frameshift, we observed no detectable change in frameshift efficiency relative to wild-type E. coli for E. coli ΔselU or E. coli ΔselU complemented with selU, while for the −1 frameshift we observed a two-fold reduction in frameshift efficiency for E. coli ΔselU and no significant change for E. coli ΔselU complemented with selU (Figure 5d). +1 frameshifting at GCC AAGC has been reported to be induced by uncharged lysyl-tRNAs due to ribosome stalling at lysine codons in lysine-limiting conditions.30 However, the amount of lysyl-tRNA or tRNALysUUU does not change as a function of geranylation levels (Supplementary Fig. 14), suggesting that geranylation either results in ribosome stalling by affecting a different step of protein synthesis or induces frameshifting by a completely different mechanism.
Posttranscriptional modifications are ubiquitous on all tRNA molecules and are known to affect their function.31 Among the most frequently modified nucleotides in tRNAs are the anticodon wobble position (nucleotide 34) and the purine adjacent to the anticodon (nucleotide 37). These modifications ensure binding to the correct codon and are required for the efficient recognition of several codons by the same tRNA.32 Modifications at U34 are of particular importance for wobble base pairing. In bacteria, the 5-methylaminomethyl group on tRNALysUUU and tRNAGluUUC is important for efficient recognition of AAG and GAG, respectively.33-34 The 2-thio group in modified U34 of tRNAGluUUC has been reported to increase recognition of GAA33 and to be required for efficient binding to the tRNA synthetase.35 Moreover, in tRNALysUUU the 2-thio modification enables binding to the ribosome.36-37
Previously reported RNA modifications are predominantly small or hydrophilic.5 Our identification of two lipid-linked nucleotides mnm5ges2U and cmnm5ges2U represent to our knowledge the first characterized oligoisoprenylated nucleotides. These modifications are present in E. coli, E. aerogenes, P. aeroginosa, and S. thyphimurium at U34 in the anticodon of tRNAGluUUC, tRNALysUUU and tRNAGlnUUG. It is tempting to speculate that the unusual physical properties of very hydrophobic RNA modifications may have obscured their discovery using previous methods. The conditions traditionally used for thin layer chromatography and HPLC optimized for hydrophilic mononucleotides often do not allow the resolution of very hydrophobic molecules.38 Further, while mass spectrometry has previously been used to characterize novel nucleotides, the use of mass spectrometry in the primary screening method in this work enabled the detection and analysis of low-abundance species.
Our findings suggest that RNA geranylation is an alternative to selenation at low selenium concentrations. Interestingly, our results indicate that selU can directly catalyze geranylation of (c)mnm5s2U, in addition to its previously known role in selenation. That two such disparate chemical alternatives are mediated by a single enzyme (SelU in E. coli) may seem surprising, but we speculate that the active site can accommodate either selenophosphate, the substrate for 2-selenouridine synthesis,25 or geranyl pyrophosphate, the substrate for geranylation. Indeed, in Salmonella mutation of Cys 97, the homologue of the active site residue for selenation in selU, to Ala in the rhodanese domain of sufY(G67E) strongly reduces the level of the previously uncharacterized C10H17 modification.17 Even if the same residues on the protein mediate both selenation and geranylation, the two reactions are likely to proceed through distinct pathways since for selenation the sulfur at 2-thio position must be converted into a leaving group as selenium acts as nucleophile, while for geranylation the 2-thio group acts as nucleophile. Further studies on this enzyme are necessary to reveal the detailed mechanism of RNA geranylation.
High geranylation levels can affect codon bias and frameshift efficiency during translation. The exact mechanisms by which geranylation modulates codon bias and frameshifting remains to be elucidated. Frameshifting at A AAA AAG is thought to be based on the preferential recognition of AAA over AAG by the anticodon mnm5s2UUU.39 Hypomodification of tRNALysUUU at U34 reduces −1 frameshifting at NNA AAG sites, possibly due to improved wobble pairing.40 Likewise, high geranylation levels may also reduce the preference of tRNALysUUU for AAA over AAG, thereby inhibiting frameshifting. This hypothesis is supported by our observation of decreased codon bias of tRNAGluUUG upon geranylation. Considering the importance of position of U34 in codon recognition as well as tRNA-synthetase and ribosome binding,35-37 it is possible that a large hydrophobic modification at this tRNA position could have a broad impact on translation.
Recent studies in S. cerevisiae suggest that the large-scale coordination of tRNA modification is a translational response to cellular stress.41-42 Modification of RNA with unusually hydrophobic groups could also affect subcellular localization by inducing association with proteins or the lipid bilayer. Small molecule-RNA conjugates may therefore represent an additional basis for translation-independent RNA localization.43-44 Geranylation and other hydrophobic modifications of RNA may also play even broader roles in cellular function. Boxer and coworkers recently reported that lipid-linked DNA oligonucleotides enable vesicle fusion in vitro by a process resembling SNARE-mediated exocytosis.45 While lipid-oligonucleotide conjugates that induce membrane association in living systems have yet to be discovered, analogous processes enabled by lipid-linked cellular RNA may exist in nature.
For negative ion mode mass spectrometric analysis, 500 μg of isolated total RNA or commercial RNA (wheat tRNA or bovine total RNA (Sigma)) were digested with 10 U nuclease P1 (Wako Chemicals) in 500 μL of 50 mM NH4OAc, pH 4.5 at 37 °C for 40 min. The control samples were prepared by incubating 500 μg of RNA with 10 U heat-inactivated nuclease P1 (pre-incubated at 95 °C for two hours) in 500 μL of 50 mM NH4OAc, pH 4.5 at 37 °C for 40 min. The digestion products were purified by size-exclusion chromatography (NAP5, GE Healthcare) and the small-molecule fraction was collected and lyophilized. The size-exclusion chromatography step during the screen was performed in the presence of 30-50% methanol.
For positive ion mode mass spectrometric analysis, 500 μg of isolated total RNA or commercial RNA were digested with 10 U nuclease P1 and 100 U alkaline phophatase (Sigma) in 500 μL of 50 mM NH4OAc, pH 6 at 37 °C for 4 hours. The digestion products were purified and lyophilized as described above.
LC/MS was performed using a Waters Aquity UPLC Q-TOF Premier instrument. For negative ion mode analysis of the small molecule-RNA conjugate screen and characterization of nuclease P1-digested RNA, samples were dissolved in 50 μL of 0.05% NH3HCOOH in 1:1 deionized water:DMSO and LC was performed using a gradient from 0.1% aqueous ammonium formate (A1) to methanol (B1) on an Aquity UPLC BEH C8 column (1.7 μm, 2.1 mm x 50 mm, Waters) at a constant flow rate of 0.3 mL/min. The mobile phase composition was as follows: 100% A1 for 1 min; linear increase over 21 min to 100% B1; maintain at 100% B1 for 8 min; return to 100 % A1 over 5 min. The column was washed several times using short gradients from A1 to B1 between subsequent LC runs. Electrospray ionization was used with a capillary voltage of 3 kV, a sampling cone voltage of 25 V and 40 V, and a LM resolution of 4.7. The desolvation gas temperature was 300 °C, the flow rate was 800 L/hour, the source temperature was 150 °C, and the detector was operated in negative ion mode. LC/MS/MS experiments of the identified dinucleotides were performed at collision energies of 20 and 30 eV. LC/MS/MS/MS for fragmentation of the nucleobase of mnm5ges2U was performed with the cone voltage at 180 V and collision energies of 10 and 20 eV.
Positive ion mode analysis for the characterization of RNA digested with nuclease P1 and alkaline phosphatase (Sigma) was performed by dissolving the samples in 50 μL of 0.1% aqueous NH3HCOOH and performing LC using a linear gradient from 0.1% aqueous formic acid (A2) to acetonitrile (B2) on an Aquity UPLC BEH C8 column (1.7 μm, 2.1 mm x 50 mm) at a constant flow rate of 0.3 mL/min. The mobile phase composition was as follows: 100% A2 for 1 min; linear increase over 21 min to 100% B2; maintain at 100% B2 for 8 min; return to 100% A2 over 5 min. Electrospray ionization conditions were as described above with the detector operating in positive ion mode. LC/MS/MS experiments of the identified dinucleotides were performed at collision energies of 10, 20, and 30 eV. LC/MS/MS/MS analysis of the nucleobase of mnm5ges2U was performed with a cone voltage of 70 V and collision energies of 10, 20, 30, and 40 eV. LC/MS/MS/MS analysis of the nucleobase of cmnm5ges2U was performed with the cone voltage at 40 V and collision energies of 10, 20, 30, and 40 eV.
A solution of 2-thiouridine (26.0 mg, 0.1 mmol, Santa Cruz Biotechnology), geranyl bromide (57 μL, 0.3 mmol) and N,N-diisopropylethylamine (52.2 μL, 0.3 mmol) in methanol (1 mL) was stirred at 25 °C for 12 h. The resulting reaction mixture was concentrated in vacuo and directly subjected to silica gel chromatography. A white solid (38.9 mg, 98% yield) was obtained after flash chromatography: TLC Rf = 0.10 (10% MeOH/EtOAc); 1H NMR (600 MHz, CDCl3) δ 8.25 (d, J = 7.2 Hz, 1H), 5.98 (d, J = 7.2 Hz, 1H), 5.91 (d, J = 5.4 Hz, 1H), 5.28 (t, J = 7.2 Hz, 1H), 5.05 (t, J = 7.2 Hz, 1H), 4.45 (m, 1H), 4.37 (m, 1H), 4.18 (s, 1H), 3.88-3.96 (m, 2H), 3.78-3.86 (m, 2H), 2.04-2.12 (m, 2H), 1.98-2.04 (m, 2H), 1.72 (s, 3H), 1.68 (s, 3H), 1.59 (s, 3H); 13C NMR (125 MHz, CDCl3) δ 170.4, 164.9, 143.8, 141.2, 132.1, 123.9, 115.9, 109.0, 92.2, 86.1, 75.7, 70.8, 61.5, 39.9, 31.7, 26.6, 25.9, 18.0, 17.0; HRMS-EI (m/z): [M+H]+ calculated for C19H28N2O5S = 397.1792, observed = 397.1800.
This work was supported by the Howard Hughes Medical Institute and the NIH/NIGMS (R01GM065865). C.E.D. acknowledges fellowship from the Novartis Foundation. A.M.L. was supported by a NIH National Research Service Award Postdoctoral Fellowship (F32GM095028). We thank Dr. Shao-Liang Zheng for his help with X-ray diffraction and structural determination of synthetic geranyl-2-thiouracil. We thank Dr. Alan Saghatelian for providing Pseudomonas aeruginosa and Dr. Irene Chen for providing Salmonella typhimurium and BL2 facilities. We are also grateful to Jacob Carlson for his assistance, and to Professor Michael Ibba for helpful discussions.
Competing financial interest The authors declare no competing financial interests.
Author contributions C.E.D., Y.C., A.M.L., and Y.G.C. designed, and performed the experiments. All authors analyzed the results and wrote the manuscript.