Character attributes represent transformation pathways and hypotheses of relationship that are falsifiable and link character states to each other using basic evolutionary assumptions or axioms
[90]. Phylogenetic analysis of RNA structure rests on a very simple model of change in which geometrical or statistical features of structure (e.g. length of structural elements; values of Shannon entropy of the base pairing probability matrix) increase or decrease in value and on the auxiliary assumption (hypothesis of polarization) that there is an evolutionary tendency towards conformational order. Molecules in solution express different degrees of freedom, usually in the form of translations and rotations (e.g. internal rotations around single bonds) or dynamic motions that define different molecular conformations. In RNA, degrees of freedom are notably constrained by the formation of hydrogen bond interactions responsible for base pairs. This interplay is highly frustrated. Statistical mechanic simulations have successfully modeled the formation of secondary structure in RNA and the impact of mutation on structural change
[56],
[57]. We based our polarization hypothesis in this model. Within the range of free energies accessible at a given temperature, an RNA molecule folds into an ensemble of possible conformations (shapes). This ‘plastic repertoire’ delimits the time the RNA spends in each conformation. Molecular functions impact the fitness of an organism and are usually linked to certain conformation within the plastic repertoire, which are selected during evolutionary change. The more time a molecule spends in favored conformations the greater the molecule's impact on the organism's fitness. During selection, sequence mutants optimize folding to fewer thermally accessible conformations, most of which resemble the target and are most stable, spending more time in them. Moreover, the numbers of conformations that are accessible to the mutants also decreases and fold to nearly the target. This ‘lock-in’ process of structural canalization is autocatalytic and defines a general evolutionary trend of RNA molecules towards uniqueness, greater stability, and modularity. We here use this trend as hypothesis of character polarization by treating character states corresponding to increased structural order as being ancestral (plesiomorphic). Although this is a falsifiable hypothesis, thermodynamic, molecular mechanic, and phylogenetic considerations provide considerable theoretical and experimental evidence to support the polarization trend. These arguments have been recently summarized
[91] and some are here revisited: (a)
Thermodynamic arguments. The thermodynamic theory of evolution
[92],
[93] develops general principles that are applicable to biological systems of all hierarchies, ranging from molecular ensembles to ecosystems
[94]. According to this theory, biological systems are self-organizing and tend to increase the order and complexity of the system by dissipating the disorder to their surroundings. These thermodynamic principles generalized to account for non-equilibrium conditions have experimentally verified a molecular tendency towards order and stability driving biological change
[95]. (b)
Molecular mechanic arguments. A large body of theoretical evidence that maps the structural repertoire of evolving RNA sequences from energetic and kinetic perspectives confirms evolution enhances conformational order and diminishes conflicting molecular interactions
[56], with some important predictions supported experimentally
[58],
[96]. Studies of extant and randomized RNA sequences have also shown these tendencies. Randomizations of mono- and dinucleotides in single-stranded nucleic acids have been used to assess the effects of composition and order of nucleotides in the stability of folded molecules, uncovering evolutionary processes acting at DNA and RNA levels
[97]. In recent experiments, extant evolved RNA molecules encoding complex, functional structural folds were compared to oligonucleotides corresponding to randomized counterparts
[98]. Unlike evolved molecules, arbitrary sequences were prone to having multiple competing conformations. In contrast to arbitrary proteins, which rarely fold into well-ordered structures
[99], these arbitrary RNA sequences were however quite soluble and compact. They appeared delimited by physicochemical constraints such as nucleotide composition that were inferred in previous computational studies
[96]. (c)
Phylogenetic arguments. Tendencies towards structural order and the hypothesis for rooting of trees have been experimentally verified by phylogenetic congruence between trees generated from RNA sequence and those generated from structure
[12],
[13],
[100], in addition to congruence between phylogenies generated from geometric and statistical characters
[30],
[99],
[101]. Polarizing characters in the opposite direction resulted in trees that were less parsimonious and had topologies incompatible with conventional taxonomy. Phylogenetic analyses testing hypotheses of organismal origin derived from global trees of tRNA structures and constraint analysis
[102] and phylogenies of proteomes derived from an analysis of protein structures in entire genomic complements
[33] proved to be congruent. They provide further indirect support to our hypothesis of polarization. Interestingly, we found character state changes are considerable, for example, along the basal branches of trees of helical substructures and in several other places of the tree (
Figure S8). This suggests that the ancestral placement of basal helices (e.g., h44) does not result from helices being longer or from a ‘long branch attraction’ artifact. It also shows that stability and frustration of substructures are indeed important and congruent factors shaping the structure of rRNA. Since many of these structural components are functionally important, the increased frequency of character state change could reflect the various adaptations that are unique to organisms in different environments involved in the regulation of the translation process.