The efficiency and fidelity of translation is reinforced by many mechanisms encoded in genomes. In reduced genomes, mutation rates are typically high, and selection becomes less effective in maintaining translational mechanisms. In this study, we found that bacterial endosymbiont lineages (
Buchnera) that experience relaxed selection display less optimal tRNA characteristics relative to those of their free-living relative
E. coli. Gene loss and A

+

T mutational bias in
Buchnera have lead to the loss of tRNA isoacceptors and loss of modified base pathways, the reduction of tRNA gene length, and the accumulation of base substitutions and indels (insertions/ deletions) in tRNA sequences that weaken tRNA secondary structure and possibly aminoacyl-tRNA synthetase recognition. These tRNA characteristics are conserved across four
Buchnera lineages spanning 70 million years of divergence and may result in reduced translational efficiency and fidelity relative to their ancestors. However, we did detect compensatory base substitutions in
Buchnera tRNAs, which are expected to maintain secondary structure of tRNA stem regions. Additionally, RNAseq reads reveal novel 3′ maturation processes that compensate for tRNA gene length reduction.
Divergent
Buchnera taxa in this study encode and express the same 32 tRNA genes composed of 32 different isoacceptor types (). In turn, no duplication of tRNA gene isoacceptors was found. Based on a survey of 50 eukaryotic, eubacterial, and archaeal genomes, low tRNA gene redundancy (i.e. only one or two gene copies of a particular isoacceptor) was only found in all archaeans and several bacterial genomes, and was approximately correlated with genome size (
40). In
Buchnera, because of modified wobble rules (
50,
51), all mature tRNAs expressed can theoretically base pair with the 61 possible codons (, ), which are all still encoded in
Buchnera CDS. One special
Buchnera isoacceptor that has been identified previously in
Buchnera-Ap (taxa type strain APS) is tRNA Ile
CAU (
40), where 5′-C is modified into lysidine by the enzyme TilS in
E. coli (
55), which all
Buchnera strains still encode. This special Ile
CAU isoacceptor codes for Ile instead of Met due to a wobble modification, and is ubiquitous in Eubacteria and Archaea (
40).
During genome reduction,
Buchnera has preferentially lost 5′-CNN, and to a lesser extent, 5′-GNN anticodons in family boxes and 5′ CNN anticodons from two-codon NNR families (). This pattern of tRNA isoacceptor loss is common for many bacteria with reduced genomes (), and is most likely related to gene deletion processes. Selective loss of these specific isoacceptors in family boxes and NNR two-codon families in Eubacteria was observed in previous studies (
1,
40,
68,
69) but was related to A

+

T sequence bias not deletion processes (
1,
70). We hypothesize that genome reduction, which is correlated with A

+

T bias, is the most likely explanation for this pattern of tRNA isoacceptor loss. First, the potential for wobble in codon–anticodon basepairing implies that some tRNA isoacceptors are not essential for pairing with corresponding codons (e.g. 5′-CNN, 5′-GNN anticodons) and can be eliminated through mutation and deletion. Second, due to wobble rules, 5′-GNN anticodons followed by 5′-UNN anticodons are the most promiscuous isoacceptors when pairing with cognate codons; thus, it is not surprising that 5′UNN is always retained in family box and two-box NNR codons in the most reduced genomes. In turn, 5′-UNN anticodons are probably retained because of their ability to recognize alternative codons rather than because of the high frequency of cognate codons in A

+

T rich CDS. Typically in bacteria and eukaryotes 5′-CNN and 5′-GNN anticodons of family boxes and 5′-CNN anticodons from two-codon families along with 5′ U anticodon modifications extending wobble are maintained by selection, because they increase the efficiency of translation (
1,
71). We predict that the loss of tRNA isoacceptors in
Buchnera as well as other endosymbionts potentially results in less efficient translation.
Numerous unmodified nucleotides at specific nucleotide positions on tRNA isoacceptors are conserved phylogenetically and are known to play crucial roles in defining tRNA specificity for aminoacylation (
32,
72). These conserved nucleotides are called identity elements and are required for proper recognition by the cognate aaRS in addition to playing roles as deterrents to false recognition (
32). Our results reveal that most
Buchnera tRNAs have maintained identity elements homologous to those in
E. coli, with the exceptions of Cys
GCA, Ser
GGA, Ser
GCT and Ala
GGC. In
E. coli tRNA
cys, the identity elements G15·G48 form an unusual tertiary base pair called a Levitt pair (
73). Additionally, the
E. coli identity elements A13·A22 are important in determining the structure of G15·G48 (
74). Collectively, these
E. coli identity elements are required for CysRS recognition due to their role in RNA tertiary structure (
73). In all,
Buchnera taxa, tRNA
cys G15·G48 has mutated to
U15·G48 and A13·A22 has mutated to
G13·A22. Hou
et al. (
73) found that when G15·G48 is mutated to
U15·G48, its backbone configuration is similar to the wild type tRNA
cys; however, only partial aminoacylation (46.2%) occurs relative to the wild type. How both types of changes in identity element together affect tertiary structure is unknown.
In
Buchnera tRNA Ala
GGC, the identity element G20 is mutated to
U20 in strains 5A and Ua and to
C20 in Ak and Sg. In
E. coli tRNA Ala
VGC, these same base changes were shown to result in 6× and 50× reductions in alanine charging activity, respectively, relative to native tRNA Ala
VGC (
75).
Buchnera Ala
UGC does not possess this mutation. Potentially, if this mutation is deleterious in Ala
GGC recognition, Ala
UGC can wobble to all four alternative codons for the family box codon family for alanine. Interestingly, the smallest sequenced genome of
Buchnera, for the host
Cinara cedri, retains the same tRNA isoacceptors and aaRSs as other
Buchnera taxa examined in this study; however, the Ala
GGC tRNA gene has been lost, resulting in a total of only 31 tRNA genes.
In
Buchnera tRNA Ser
GGA, the identity element G73 (the discriminator base) has mutated to
A73. Generally a mutation in the discriminator base is known to result in the loss of cognate aminoacyl-tRNA synthetase recognition; however, Shimizu
et al. (
76) demonstrated that any four bases substituted in the discriminator base of
E. coli Ser tRNA resulted in the same level of aminoacylation. Nevertheless, G73 in Ser tRNA is phylogenetically conserved (
72) and has been shown to play minor roles in SerRS discrimination (
77,
78). Additionally, in
E. coli Ser tRNA, the variable region plays a very important role as an identity element (
77,
79). In all
Buchnera taxa, except Sg, the variable region length of the Ser
GCT isoacceptor is 1

bp shorter than the
E. coli Ser
GCT isoacceptor. In summary, it is unknown how all these mutated identity elements affect
Buchnera translation, but the same mutations in
E. coli are known to significantly reduce the efficiency of aminoacylation.
In addition to requiring specificity in aminoacylation, reliable and efficient translation requires the anticodon to correctly pair with its codon. Modified nucleosides of tRNAs are essential mechanisms reinforcing translational fidelity and efficiency, especially at the wobble (N34) and 3′ position immediately adjacent to the anticodon (P37), (
1,
2,
51). Based on
E. coli tRNA homologs, we expect 16 different types of modified bases to be present in the remaining 32
Buchnera tRNAs, for both N34 and N37 positions. In
E. coli, 13 of these modified base pathways are known and
Buchnera encodes complete pathways for six of these (). All
Buchnera taxa have lost enzymes responsible for encoding N37 modified bases m
2A and m
6A, which are important in stabilizing 5′-NNC/G anticodons (
2) (). Enzymes that synthesize the N37 modification m
6t
6A are conserved in only half of
Buchnera taxa; this enzyme is known to slightly increase the efficiency of base pairing of the anticodon Thr
GGU to the codon ACC in
E. coli (
54). All N37 modified base pathways important for preventing frameshifts and stabilizing A:U and U:A at the wobble position of the anticodon and the first position of the codon were retained in all
Buchnera taxa (). These mechanisms may be essential for the fidelity of translation, especially for A

+

T rich genomes.
Modified nucleosides at the wobble base position (N34) of the anticodon are important for encoding the right amino acid, extending or restricting wobble, increasing the efficiency of base pairing and preventing frameshifts (
2,
53,
55,
56,
58,
59).
Buchnera taxa all encode the enzyme TilS that is essential for the synthesis of the modified base lysidine, and is important for encoding the amino acid Ile instead of Met (
59). All
Buchnera taxa also encode the core enzymes MmmE and MnmG that are important for synthesizing the modified bases mnm
5u, mnm
5s
2U, and cmnm
5Um, which restrict 5′U wobble in NNR two-box codons, including Arg and Leucine (). All of these pathways are complete except for MmmC, which is involved in the last step for both modified bases, mnm
5u and mnm
5s
2U. However, RNAseq mismatch evidence supports the presence of a modified base at the expected position of mnm
5s
2U (). Interestingly, the genes encoding MmmA, MmmE, MnmG, and IscS or SufS, but not MnmC are retained in several tiny endosymbiont genomes (
10). Conservation of these enzymes in reduced genomes indicates that these enzymes or derivatives are important for the production of the modified bases mnm
5u, cmnm
5Um, and especially mnm
5s
2U, which is essential for preventing frameshifts and restricting wobble in NNR two codon boxes (Glu, Lys and Gln), thereby preventing the miscoding of amino acids. For incomplete pathways producing modified bases mnm
5u and mnm
5s
2U, either a derivative may be synthesized and/or the insect host may import MnmC. For example, the pea aphid,
A. pisum expresses its
mnmC homolog (XP_003245837) in both its body and in the specialized aphid cells (bacteriocytes) that contain
Buchnera cells (
34).
Another key enzyme that is retained in
Buchnera is TadA, which is responsible for synthesizing inosine in
E. coli (
55). This wobble modification is present on Arg
ACG in many bacteria and can wobble to three alternative codons of Arg (
2,
59). Rnaseq mismatch evidence highly supports this modification, as inosine is recognized as G during the reverse transcription process (
60), and therefore we were able to measure a high frequency of modified Arg
ACG transcripts from all
Buchnera taxa. Unfortunately, other modified bases do not appear to be recognized as specific bases and in turn incorporate different frequencies of any of the four bases during reverse transcription of modified transcripts (
42,
43). Collectively, Rnaseq evidence supported the presence of five modified bases, four in which the pathways are known and present (or near present for mnm
5s
2U) and one in which the pathway is unknown (). If
Buchnera tRNAs can be isolated without host contamination, modified base presence and identity can be confirmed.
In many bacterial species, tRNA abundances are positively correlated with codon usage for highly expressed genes, thus increasing translational efficiency (
64,
65). In addition to analysing specific tRNA characteristics that influence the accuracy and efficiency of translation, we examined whether codon usage correlates with tRNA expression. We found that tRNA sense expression is highly correlated across
Buchnera taxa (), and many tRNA isoacceptors are expressed at similar levels within taxa (,
Supplementary Figure S1). A previous microarray study suggested that tRNA expression and codon usage of 50 highly expressed genes in
Buchnera-Ap were positively correlated (
37), but the relationship was weak and expression of sense and antisense tRNAs were not distinguished, possibly confounding results. Our directional RNAseq data show no relationship between tRNA expression and codon usage, for the same set of highly expressed genes in
Buchnera-Ap under similar conditions (). Furthermore, no relationship was detectable in three other
Buchnera taxa (
Supplementary Figure S1). Collectively, these results suggest that selection is not maintaining codon bias for highly expressed proteins. Interestingly, Trp
CCA, is the highest expressed isoacceptor in all
Buchnera taxa (except Ua) and has very low codon usage. In all,
Buchnera examined, isoacceptor Trp
CCA displays one of the lowest secondary structures relative to
E. coli’s homolog; potentially Trp
CCA is highly expressed to compensate for low aminoacylation efficiency related to numerous base substitutions that weaken its secondary structure [(a)].
In this study, we found that
Buchnera tRNAs have maintained high %GC relative to its CDS; however, its tRNAs are more A

+

T rich and less stable relative to homologs in
E. coli (). These results are consistent with previous findings (
28) showing that 16S rRNAs of
Buchnera and other endosymbiont species are more A

+

T rich and less stable than those of free-living relatives. Similarly, mitochondrial tRNAs from animals are more A

+

T rich and less stable than nuclear tRNAs (
80). Collectively, these results suggest that the accumulation of deleterious mutations can lead to less stable secondary structures of essential RNAs involved in translation. Some selection for stabilization is also evident as numerous compensatory base substitutions have been fixed in the stem regions of both rRNAs (
28) and tRNAs (). Alternatively,
E. coli tRNAs may possess higher %GC because its optimal growth temperature is higher than that of
Buchnera (
81), thus favoring higher %GC for increased thermal stability.
During genome reduction, 72–78% of
Buchnera tRNA genes among all taxa have deleted 3

bp, due to the loss of 3′ encoded CCA [(a)]. Nevertheless, we found that all mature
Buchnera tRNAs process 3′ CCA, and therefore they all have potential for amino acid activation [(b)]. In all
Buchnera taxa, six to eight mature tRNAs process dual or triple 3′ CCA [(b)]. These characteristics, in addition to 5′ G at the 1st and 2nd position and instability of the acceptor stem, result in tRNA degradation (
67). Interestingly, these tRNAs in
Buchnera and
E. coli transcribe 5′ G at the first and second base position and process dual or triple 3′ CCA [ (c)]. In these mature tRNAs in both
E. coli and
Buchnera, the second to last 3′ CCA is always incorporated into the 3′ acceptor stem. Potentially, the retention of encoded 5′ G at N1 and N2 and the conservation of dual and triple 3′ CCA maturation in these tRNAs [ (c)] are essential to maintain the correct secondary structure and to police unstable tRNAs via the tRNA degradation pathway.
In conclusion, our observations of altered tRNA characteristics are consistent with the hypothesis that translational fidelity is lower in
Buchnera compared with free-living relatives as represented by
E. coli. First,
Buchnera genome reduction has resulted in the loss of specific tRNA isoacceptors and modified nucleoside pathways that may reduce translational efficiency and fidelity. Second,
Buchnera’s A

+

T mutational bias and reduced selection has resulted in the reduction of tRNA stability
in vitro and specific tRNA base substitutions that may alter the efficiency of aaRS recognition. Moreover, reduced translational efficiency was supported by the lack of relationship between codon usage of highly expressed genes and cognate tRNA isoacceptor expression. Nevertheless, purifying selection appears to be strong enough in
Buchnera genomes to maintain high %GC of tRNA genes relative to CDS. Also, CCA 3′ maturation of shortened tRNA genes, and numerous compensatory base substitutions in tRNA stems help maintain tRNA secondary structure and function. Consequently, we predict that the translational efficiency and fidelity evident in
Buchnera are in an intermediate state between free-living bacteria and organelles.