Nine nucleotides in the U3 of HIV-1, which are complementary to nucleotides (38-46) in the primer tRNA
3Lys, were found to stimulate minus strand DNA transfer
in vitro by holding together the 5′ and 3′ regions of the viral RNA genome
8,9. The nine-nucleotide motif in U3 might have been created by mutations generated during reverse transcription. It would then have been maintained though evolutionary pressure because it can promote viral replication. However, our computational alignment of tRNA
3Lys against the HIV-1 genome revealed that the nine-nucleotide segment is, in fact, a part of the larger sequence with additional complementarities to the tRNA
3Lys molecule. The complementary sequence spans segments of both the U3 and R elements. Since it is a full-length tRNA sequence, it is more likely to be an acquired cellular gene, than a sequence that evolved within the virus.
Our mutational analysis using a reconstituted system for minus strand transfer
in vitro showed that extended complementarities in the U3 region, beyond the nine-nucleotide segment, promote greater transfer efficiency. The actual mechanism by which complementary sequences in U3 that are 5′ of the nine-nucleotide motif promote transfer could be complex. They would certainly act by augmenting the proximity mechanism promoted by the nine-nucleotide segment. These other sequences also resemble the PBS sequence, and so could help to displace the primer tRNA from the PBS. The release of tRNA
3Lys from the PBS is necessary to allow copying of the template region of the PBS into DNA, and copying of the 3′ end of the tRNA into DNA, both in preparation for second strand DNA transfer
1.
A 29-nucleotide long segment near the middle of the tRNA-like sequence has no complementarity to tRNA
3Lys, but has the appropriate length and position expected for a tRNA intron. This suggests that the whole region originated from a tRNA gene incorporated into the LTR in an early stage of HIV-1 evolution. The preservation of the sequence in the LTR might partly derive from its role as a section of the HIV-1 promoter. The tRNA gene-like sequence includes the TATAA box and sites recognized by several transcription factors
27, probably created by mutations generated during reverse transcription and maintained by evolutionary pressure. It also spans the ascending stem of the TAR hairpin recognized by TAT protein, a viral activator promoting efficient transcriptional elongation
28. Mutations within these sites substantially reduce viral expression
29 and virus propagation
30.
Because phylogenetic analyses suggest that HIV and immunodeficiency viruses in other species adapted to and developed in new hosts after cross-species infections, we suspect that this tRNA gene-like sequence did not originate from human DNA. The virus infecting chimpanzees (SIVcpz) is indicated as the most recent ancestor of HIV-1
20. A cross species origin of HIV-1 is suggested by the presence of an apparent intron in the tRNA-like sequence, since human tRNA
3Lys lacks an intron
31. Introns are also absent in chimpanzee and monkey, but abundant with diverse sequences in cat genomes
31. However, tRNAs share some homology within the D- and T- arms, thus it is possible that a different tRNA gene was incorporated and during retrovirus evolution some of it evolved to be complementary to tRNA
3Lys. HIV-related viruses were identified in sheep, goats, horse, cattle and cats, but only the cat virus FIV appears to be a close relative of HIV and SIV
24,32. Our sequence alignment within the LTR regions revealed a high homology of HIV-1 with SIVcpz within the tRNA gene-like sequence, but not with FIV. Because FIV is thought to be a very old virus, the tRNA-like sequence in U3/R might have once been present in FIV but lost during divergent evolution.
Determination of the origin of the tRNA-like sequence should provide valuable clues about the ancestry of HIV. We known that vertebrate retroviruses can acquire new genes from their hosts during development
33. However, it is also possible that a tRNA gene was incorporated much earlier; conceivably when the viral ancestor was a mobile segment of the host genome that evolved into an infectious virus. Evolutionary studies suggest that all retroviruses evolved from retrotransposons, which are ubiquitous mobile pieces of DNA replicating through RNA intermediates
34. Phylogenetic analyses of the RNase H domains of RTs revealed that the group of LTR retrotransposons could have originated from the non-LTR group
35, and later become the ancestors of the retroviruses
36,37.
There are striking similarities between vertebrate retroviruses and LTR retrotransposons in both replication and integration mechanisms, and their “genome” organization. For some LTR retrotransposons, e.g. Ty3 in yeast, an intracellular virus-like particle had been observed
38. Interestingly,
gypsy-like elements in
Drosophila previously classified as LTR retrotransposons, are now considered to be endogenous retroviruses
39. The Ty3/
gypsy-like LTR retrotransposons are indicated as a group from which vertebrate retroviruses evolved, or a group that evolved from the same ancestor as
Retroviridae37,40. Interestingly, Ty3 LTR retrotransposons in yeast reveals unique integration preferences, namely in the vicinity of genes transcribed by RNA polymerase III, notably tRNA genes
41. Similar integration specificity was found among non-LTR retrotransposons TRE in social amoeba
42. Conceivably, the tRNA gene-like sequences in HIV and SIV are relics of a tRNA gene incorporated by an ancient retrotransposon. Notably, tRNA-like sequences are commonly found in non-LTR retrotransposons, like SINEs and LINEs
43-45. In addition, Bak and Jorgensen showed that a number of retroviruses and LTR retrotransposons have conserved sequences of the D- and T- arms of tRNA within the LTR
46. Moreover, a recent computer analysis revealed the presence of tRNA-like sequences in various transposable elements and viruses in different taxonomic divisions
47. Incorporation of tRNA genes into LTR of retrotransposons and retroviruses might have been common throughout history.
The significance of tRNA-like sequences found in some LTR retrotransposons and viruses is not clear. In HIV-1, the U3/R sequence is complementary to the tRNA
3Lys used by the virus as a primer to start reverse transcription. The tRNA
3Lys might take advantage of these complementarities and form interactions that bring both ends of the RNA viral genome into proximity to promote minus strand DNA transfer
8,9. Other roles for the tRNA-like sequence in HIV-1 could derive from the functions of the D- and T- arms of the tRNA gene representing the internal RNA polymerase III control elements called the A- and B- boxes, respectively
48. The presence of at least one of the tRNA boxes in proximity to RNA polymerase II promoters stimulates genes expression
49. The boxes are binding sites for transcription factor TFIIIC, which is also required for targeting of tRNA genes during integration of Ty3 and TRE retrotransposons
50,51. Interestingly, it was demonstrated that HIV-1 integration might occur in proximity to Alu and L1Hs elements
52,53, where conserved tRNA boxes were also identified
43,45. Moreover, the tRNA boxes and TFIIIC might help in preventing silencing of gene expression, since they are known to restrain heterochromatin spreading in genomic DNA
54. These considerations suggest that a tRNA-like sequence or A- or B- boxes in the LTR is important for replication and/or mobility of retrotransposons and possibly some viruses today. The lack of a tRNA gene-like sequence in the LTRs of HIV2, FIV and many SIV species indicates that the sequence is not absolutely required for minus strand DNA transfer to occur, but the presence of complementary sequences to primer tRNA at the 3′ ends of other retroviruses might significantly stimulate this reaction.
Overall, we suggest that the tRNA-like sequence in U3/R of HIV-1 is a remnant of an event in which a mobile ancestor of HIV acquired a cellular tRNA sequence. We believe that this sequence has been retained in recognizable form because it supports efficient minus strand transfer and transcription of the viral genome. Further investigation of this sequence offers opportunities to probe the ancestry of HIV and the roles of RNA structure in strand transfer.