ADARs contain dsRBMs so it is not surprising that their substrates are highly base-paired molecules. What is surprising, however, is that the sequences targeted by ADARs are almost entirely double-stranded. and show secondary structures for representative ADAR substrates. In contrast to the familiar secondary structures of tRNA and rRNA, which consist of short double helices branching off from nonhelical sequences, secondary structures of ADAR substrates usually consist of one long, unbranched double helix. Whereas the R/G hairpin is one of the shortest helices known to be targeted by an ADAR in vivo () (
31,
53), others consist of hundreds of nearly contiguous base-pairs (
45) or, like HDV, a long helix interrupted by mismatches, bulges, and loops ().
Most of the known ADAR substrates form their structures when a polynucleotide “folds back” to create an intramolecular hairpin, as shown in . However, there are also examples of substrates that form through intermolecular interactions, such as the bFGF mRNA pairing with a naturally occurring antisense RNA (
94) and the early and late transcripts of polyoma virus (
80).
RNA structures are commonly derived through a rigorous battery of techniques, including biochemical, phylogenetic, and biophysical methods. By these criteria, very few of the structures of ADAR substrates are rigorously proven. However, in many cases compensatory mutations have been analyzed, and these studies verify the interaction of the complementary sequences, albeit not their details (as cited in 76). The structures of the long hairpins found in noncoding sequences have not been proven (
45) (). However, pairing seems almost certain in these molecules, and the presence of inosines is proof of their double-stranded character because ADARs will not target single-stranded RNA (
8,
95).
The R/G site hairpin is an exception to the above because it has been proven with mutagenesis, biochemistry, and phylogenetics (
31,
53). Interestingly, the pattern of covariation observed in the phylogenetic analysis of the R/G hairpin was quite different than expected (
31). Phylogenetic analyses of tRNA and rRNA show that, in general, the identity of bases within helical regions are less conserved than those in nonhelical regions. In contrast, for the R/G site hairpin, bases in helices are most conserved, and bases in nonhelical regions covary so as to maintain their nonhelical state (). This pattern is not so surprising because, as described below, the exact position of loops and other unpaired sequences found in ADAR substrates is critical in maintaining enzyme specificity. The conservation of sequences predicted to be in helical regions has also been noted for the structure important for editing serotonin receptor mRNA (
66).