Recent developments in cosmology radically change the conception of the universe as well as the very notions of "probable" and "possible". The model of eternal inflation implies that all macroscopic histories permitted by laws of physics are repeated an infinite number of times in the infinite multiverse. In contrast to the traditional cosmological models of a single, finite universe, this worldview provides for the origin of an infinite number of complex systems by chance, even as the probability of complexity emerging in any given region of the multiverse is extremely low. This change in perspective has profound implications for the history of any phenomenon, and life on earth cannot be an exception.
Origin of life is a chicken and egg problem: for biological evolution that is governed, primarily, by natural selection, to take off, efficient systems for replication and translation are required, but even barebones cores of these systems appear to be products of extensive selection. The currently favored (partial) solution is an RNA world without proteins in which replication is catalyzed by ribozymes and which serves as the cradle for the translation system. However, the RNA world faces its own hard problems as ribozyme-catalyzed RNA replication remains a hypothesis and the selective pressures behind the origin of translation remain mysterious. Eternal inflation offers a viable alternative that is untenable in a finite universe, i.e., that a coupled system of translation and replication emerged by chance, and became the breakthrough stage from which biological evolution, centered around Darwinian selection, took off. A corollary of this hypothesis is that an RNA world, as a diverse population of replicating RNA molecules, might have never existed. In this model, the stage for Darwinian selection is set by anthropic selection of complex systems that rarely but inevitably emerge by chance in the infinite universe (multiverse).
The plausibility of different models for the origin of life on earth directly depends on the adopted cosmological scenario. In an infinite universe (multiverse), emergence of highly complex systems by chance is inevitable. Therefore, under this cosmology, an entity as complex as a coupled translation-replication system should be considered a viable breakthrough stage for the onset of biological evolution.
This article was reviewed by Eric Bapteste, David Krakauer, Sergei Maslov, and Itai Yanai.
The RNA world hypothesis on the origin of life is generally considered as the key to solve the “chicken and egg dilemma” concerning the evolution of genes and proteins as observed in the modern organisms. This hypothesis, however, contains several serious weak points. We have a counterproposal called [GADV]-protein world hypothesis, abbreviated as GADV hypothesis, in which we have suggested that life originated from a [GADV]-protein world, which comprised proteins composed of four amino acids: Gly [G], Ala [A], Asp [D], and Val [V]. A new concept “pseudo-replication” is crucial for the description of the emergence of life. The new hypothesis not only plausibly explains how life originated from the initial chaotic protein world, but also how genes, genetic code, and proteins co-evolved.
GADV hypothesis; pseudo-replication; [GADV]-protein world; origin of life
The RNA world hypothesis, that RNA genomes and catalysts preceded DNA genomes and genetically-encoded protein catalysts, has been central to models for the early evolution of life on Earth. A key part of such models is continuity between the earliest stages in the evolution of life and the RNA repertoires of extant lineages. Some assessments seem consistent with a diverse RNA world, yet direct continuity between modern RNAs and an RNA world has not been demonstrated for the majority of RNA families, and, anecdotally, many RNA functions appear restricted in their distribution. Despite much discussion of the possible antiquity of RNA families, no systematic analyses of RNA family distribution have been performed. To chart the broad evolutionary history of known RNA families, we performed comparative genomic analysis of over 3 million RNA annotations spanning 1446 families from the Rfam 10 database. We report that 99% of known RNA families are restricted to a single domain of life, revealing discrete repertoires for each domain. For the 1% of RNA families/clans present in more than one domain, over half show evidence of horizontal gene transfer (HGT), and the rest show a vertical trace, indicating the presence of a complex protein synthesis machinery in the Last Universal Common Ancestor (LUCA) and consistent with the evolutionary history of the most ancient protein-coding genes. However, with limited interdomain transfer and few RNA families exhibiting demonstrable antiquity as predicted under RNA world continuity, our results indicate that the majority of modern cellular RNA repertoires have primarily evolved in a domain-specific manner.
In cells, DNA carries recipes for making proteins, and proteins perform chemical reactions, including replication of DNA. This interdependency raises questions for early evolution, since one molecule seemingly cannot exist without the other. A resolution to this problem is the RNA world, where RNA is postulated to have been both genetic material and primary catalyst. While artificially selected catalytic RNAs strengthen the chemical plausibility of an RNA world, a biological prediction is that some RNAs should date back to this period. In this study, we ask to what degree RNAs in extant organisms trace back to the common ancestor of cellular life. Using the Rfam RNA families database, we systematically screened genomes spanning the three domains of life (Archaea, Bacteria, Eukarya) for RNA genes, and examined how far back in evolution known RNA families can be traced. We find that 99% of RNA families are restricted to a single domain. Limited conservation within domains implies ongoing emergence of RNA functions during evolution. Of the remaining 1%, half show evidence of horizontal transfer (movement of genes between organisms), and half show an evolutionary history consistent with an RNA world. The oldest RNAs are primarily associated with protein synthesis and export.
Within eukaryotes there is a complex cascade of RNA-based macromolecules that process other RNA molecules, especially mRNA, tRNA and rRNA. An example is RNase MRP processing ribosomal RNA (rRNA) in ribosome biogenesis. One hypothesis is that this complexity was present early in eukaryotic evolution; an alternative is that an initial simpler network later gained complexity by gene duplication in lineages that led to animals, fungi and plants. Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, herein called the Eukaryotic Ancestor.
We present an overview of the RNA processing cascade in the Eukaryotic Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches, find previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of RNAs for RNase MRP, along with analysis of the target substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution.
We conclude that RNase MRP can now be placed in the RNA-processing cascade of the Eukaryotic Ancestor, highlighting the complexity of RNA-processing in early eukaryotes. Promoter analyses of MRP-RNA suggest that regulation of the critical processes of rRNA cleavage can vary, showing that even these key cellular processes (for which we expect high conservation) show some species-specific variability. We present our consensus MRP-RNA secondary structure as a useful model for further searches.
During protein synthesis, tRNA serves as the intermediary between cognate amino acids and their corresponding RNA trinucleotide codons. Aminoacyl-tRNA is also a biosynthetic precursor and amino acid donor for other macromolecules. AA-tRNAs allow transformations of acidic amino acids into their amide-containing counterparts, and seryl-tRNASer donates serine for antibiotic synthesis. Aminoacyl-tRNA is also used to cross-link peptidoglycan, to lysinylate the lipid bilayer, and to allow proteolytic turnover via the N-end rule. These alternative functions may signal the use of RNA in early evolution as both a biological scaffold and a catalyst to achieve a wide variety of chemical transformations.
AA-tRNA; amino acid biosynthesis; Peptidoglycan synthesis; Membrane modification; Antibiotic synthesis
According to the “thermodynamic hypothesis”, the sequence of a biological macromolecule defines its folded, active structure as a global energy minimum on the folding landscape.1,2 But the enormous complexity of folding landscapes of large macromolecules raises a question: Is there indeed a unique global energy minimum corresponding to a unique native conformation, or are there deep local minima corresponding to alternative active conformations?3 Folding of many proteins is well described by two-state models, leading to highly simplified representations of protein folding landscapes with a single native conformation.4,5 Nevertheless, accumulating experimental evidence suggests a more complex topology of folding landscapes with multiple active conformations that can take seconds or longer to interconvert.6,7,8 Here we employ single molecule experiments to demonstrate that an RNA enzyme folds into multiple distinct native states that interconvert much slower than the time scale of catalysis. These data demonstrate that the severe ruggedness of RNA folding landscapes extends into conformational space occupied by native conformations.
The evolutionary puzzle of cooperation describes situations where cooperators provide a fitness benefit to other individuals at some cost to themselves. Under Darwinian selection, the evolution of cooperation is a conundrum, whereas non-cooperation (or defection) is not. In the absence of supporting mechanisms, cooperators perform poorly and decrease in abundance. Evolutionary game theory provides a powerful mathematical framework to address the problem of cooperation using the prisoner’s dilemma. One well-studied possibility to maintain cooperation is to consider structured populations, where each individual interacts only with a limited subset of the population. This enables cooperators to form clusters such that they are more likely to interact with other cooperators instead of being exploited by defectors. Here we present a detailed analysis of how a few cooperators invade and expand in a world of defectors. If the invasion succeeds, the expansion process takes place in two stages: first, cooperators and defectors quickly establish a local equilibrium and then they uniformly expand in space. The second stage provides good estimates for the global equilibrium frequencies of cooperators and defectors. Under hospitable conditions, cooperators typically form a single, ever growing cluster interspersed with specks of defectors, whereas under more hostile conditions, cooperators form isolated, compact clusters that minimize exploitation by defectors. We provide the first quantitative assessment of the way cooperators arrange in space during invasion and find that the macroscopic properties and the emerging spatial patterns reveal information about the characteristics of the underlying microscopic interactions.
evolutionary game theory; cooperation; spatial games; prisoner’s dilemma; invasion
The RNA world hypothesis posits that the earliest genetic system consisted of informational RNA molecules that directed the synthesis of modestly functional RNA molecules. Further evidence suggests that it was within this RNA-based genetic system that life developed the ability to synthesize proteins by translating genetic code. Here we investigate the early development of the translation system through an evolutionary survey of protein architectures associated with modern translation.
Our analysis reveals a structural expansion of translation proteins immediately following the RNA world and well before the establishment of the DNA genome. Subsequent functional annotation shows that representatives of the ten most ancestral protein architectures are responsible for all of the core protein functions found in modern translation.
We propose that this early robust translation system evolved by virtue of a positive feedback cycle in which the system was able to create increasingly complex proteins to further enhance its own function.
This article was reviewed by Janet Siefert, George Fox, and Antonio Lazcano (nominated by Laura Landweber)
It is now widely accepted that at an early stage in the evolution of life an RNA world arose, in which RNAs both served as the genetic material and catalyzed diverse biochemical reactions. Then, proteins have gradually replaced RNAs because of their superior catalytic properties in catalysis over time. Therefore, it is important to investigate how primitive functional proteins emerged from RNA world, which can shed light on the evolutionary pathway of life from RNA world to the modern world. In this work, we proposed that the emergence of most primitive functional proteins are assisted by the early primitive nucleotide cofactors, while only a minority are induced directly by RNAs based on the analysis of RNA-protein complexes. Furthermore, the present findings have significant implication for exploring the composition of primitive RNA, i.e., adenine base as principal building blocks.
The RNA world hypothesis states that the early evolution of life went through a stage in which RNA served both as genome and as catalyst. The central catalyst in an RNA world organism would have been a ribozyme that catalyzed RNA polymerization to facilitate self-replication. An RNA polymerase ribozyme was developed previously in the lab but it is not efficient enough for self-replication. The factor that limits its polymerization efficiency is its weak sequence-independent binding of the primer/template substrate. Here we tested whether RNA polymerization could be improved by a cationic arginine cofactor, to improve the interaction with the substrate. In an RNA world, amino acid-nucleic acid conjugates could have facilitated the emergence of the translation apparatus and the transition to an RNP world. We chose the amino acid arginine for our study because this is the amino acid most adept to interact with RNA. An arginine cofactor was positioned at ten different sites on the ribozyme, using conjugates of arginine with short DNA or RNA oligonucleotides. However, polymerization efficiency was not increased in any of the ten positions. In five of the ten positions the arginine reduced or modulated polymerization efficiency, which gives insight into the substrate-binding site on the ribozyme. These results suggest that the existing polymerase ribozyme is not well suited to using an arginine cofactor.
Top-down causation has been implicit in many sociological accounts of social structure and its influence on social events, but the social sciences have struggled to provide a coherent account of top-down causation itself. This paper summarizes a critical realist view of causation and emergence, shows how it supports a plausible account of top-down causation and then applies this account to the social world. The argument is illustrated by an examination of the concept of a norm circle, a kind of social entity that, it is argued, is causally responsible for the influence of normative social institutions. Nevertheless, social entities are structured rather differently from ordinary material ones, with the result that the compositional level structure of reality implicit in the concept of top-down causation has some limitations in the social world. The paper closes by considering what might be involved in examining how top-down causation can be shown to be at work in the social domain.
downward causation; emergence; norm circles; social ontology; social structure
Ribonuclease P is an ancient endonuclease that cleaves precursor tRNA and generally consists of a catalytic RNA subunit (RPR) and one or more proteins (RPPs). It represents an important macromolecular complex and model system that is universally distributed in life. Its putative origins have inspired fundamental hypotheses, including the proposal of an ancient RNA world.
To study the evolution of this complex, we constructed rooted phylogenetic trees of RPR molecules and substructures and estimated RPP age using a cladistic method that embeds structure directly into phylogenetic analysis. The general approach was used previously to study the evolution of tRNA, SINE RNA and 5S rRNA, the origins of metabolism, and the evolution and complexity of the protein world, and revealed here remarkable evolutionary patterns. Trees of molecules uncovered the tripartite nature of life and the early origin of archaeal RPRs. Trees of substructures showed molecules originated in stem P12 and were accessorized with a catalytic P1-P4 core structure before the first substructure was lost in Archaea. This core currently interacts with RPPs and ancient segments of the tRNA molecule. Finally, a census of protein domain structure in hundreds of genomes established RPPs appeared after the rise of metabolic enzymes at the onset of the protein world.
The study provides a detailed account of the history and early diversification of a fundamental ribonucleoprotein and offers further evidence in support of the existence of a tripartite organismal world that originated by the segregation of archaeal lineages from an ancient community of primordial organisms.
A variety of macromolecules and small molecules—(oligo)nucleotides, proteins, lipids and metabolites—are collectively considered essential to early life. However, previous schemes for the origin of life—e.g. the ‘RNA world’ hypothesis—have tended to assume the initial emergence of life based on one such molecular class followed by the sequential addition of the others, rather than the emergence of life based on a mixture of all the classes of molecules. This view is in part due to the perceived implausibility of multi-component reaction chemistry producing such a mixture. The concept of systems chemistry challenges such preconceptions by suggesting the possibility of molecular synergism in complex mixtures. If a systems chemistry method to make mixtures of all the classes of molecules considered essential for early life were to be discovered, the significant conceptual difficulties associated with pure RNA, protein, lipid or metabolism ‘worlds’ would be alleviated. Knowledge of the geochemical conditions conducive to the chemical origins of life is crucial, but cannot be inferred from a planetary sciences approach alone. Instead, insights from the organic reactivity of analytically accessible chemical subsystems can inform the search for the relevant geochemical conditions. If the common set of conditions under which these subsystems work productively, and compatibly, matches plausible geochemistry, an origins of life scenario can be inferred. Using chemical clues from multiple subsystems in this way is akin to triangulation, and constitutes a novel approach to discover the circumstances surrounding the transition from chemistry to biology. Here, we exemplify this strategy by finding common conditions under which chemical subsystems generate nucleotides and lipids in a compatible and potentially synergistic way. The conditions hint at a post-meteoritic impact origin of life scenario.
prebiotic chemistry; systems chemistry; chemoselectivity; phosphorylation
It is still not clear how prebiotic replicators evolved towards the complexity found in present day organisms. Within the most realistic scenario for prebiotic evolution, known as the RNA world hypothesis, such complexity has arisen from replicators consisting solely of RNA. Within contemporary life, remarkably many RNAs are involved in modifying other RNAs. In hindsight, such RNA-RNA modification might have helped in alleviating the limits of complexity posed by the information threshold for RNA-only replicators. Here we study the possible role of such self-modification in early evolution, by modeling the evolution of protocells as evolving replicators, which have the opportunity to incorporate these mechanisms as a molecular tool. Evolution is studied towards a set of 25 arbitrary ‘functional’ structures, while avoiding all other (misfolded) structures, which are considered to be toxic and increase the death-rate of a protocell. The modeled protocells contain a genotype of different RNA-sequences while their phenotype is the ensemble of secondary structures they can potentially produce from these RNA-sequences. One of the secondary structures explicitly codes for a simple sequence-modification tool. This ‘RNA-adapter’ can block certain positions on other RNA-sequences through antisense base-pairing. The altered sequence can produce an alternative secondary structure, which may or may not be functional. We show that the modifying potential of interacting RNA-sequences enables these protocells to evolve high fitness under high mutation rates. Moreover, our model shows that because of toxicity of misfolded molecules, redundant coding impedes the evolution of self-modification machinery, in effect restraining the evolvability of coding structures. Hence, high mutation rates can actually promote the evolution of complex coding structures by reducing redundant coding. Protocells can successfully use RNA-adapters to modify their genotype-phenotype mapping in order to enhance the coding capacity of their genome and fit more information on smaller sized genomes.
Understanding the origin of protein synthesis has been notoriously difficult. We have taken as a starting premise Wolf and Koonin's view that "evolution of the translation system is envisaged to occur in a compartmentalized ensemble of replicating, co-selected RNA segments, i.e., in an RNA world containing ribozymes with versatile activities".
Presentation of the hypothesis
We propose that coded protein synthesis arose from a noncoded process in an RNA world as a natural consequence of the accumulation of a range of early tRNAs and their serendipitous RNA binding partners. We propose that, initially, RNA molecules with 3' CCA termini that could be aminoacylated by ribozymes, together with an ancestral peptidyl transferase ribozyme, produced small peptides with random or repetitive sequences. Our concept is that the first tRNA arose in this context from the ligation of two RNA hairpins and could be similarly aminoacylated at its 3' end to become a substrate for peptidyl transfer catalyzed by the ancestral ribozyme. Within this RNA world we hypothesize that proto-mRNAs appeared first simply as serendipitous binding partners, forming complementary base pair interactions with the anticodon loops of tRNA pairs. Initially this may have enhanced stability of the paired tRNA molecules so they were held together in close proximity, better positioning the 3' CCA termini for peptidyl transfer and enhancing the rate of peptide synthesis. If there were a selective advantage for the ensemble through the peptide products synthesized, it would provide a natural pathway for the evolution of a coding system with the expansion of a cohort of different tRNAs and their binding partners. The whole process could have occurred quite unremarkably for such a profound acquisition.
Testing the hypothesis
It should be possible to test the different parts of our model using the isolated contemporary 50S ribosomal subunit initially, and then with RNAs transcribed in vitro together with a minimal set of ribosomal proteins that are required today to support protein synthesis.
Implications of the hypothesis
This model proposes that genetic coding arose de novo from complementary base pair interactions between tRNAs and single-stranded RNAs present in the immediate environment.
This article was reviewed by Eugene Koonin, Rob Knight and Berthold Kastner (nominated by Laura Landweber).
Aptamers and the SELEX process were discovered over two decades ago. These discoveries have spawned a productive academic and commercial industry. The collective results provide insights into biology, past and present, through an in vitro evolutionary exploration of the nature of nucleic acids and their potential roles in ancient life. Aptamers have helped usher in an RNA renaissance. Here we explore some of the evolution of the aptamer field and the insights it has provided for conceptualizing an RNA world, from its nascence to our current endeavor employing aptamers in human proteomics to discover biomarkers of health and disease.
In vitro techniques for producing nucleotide sequences that bind to particular proteins allow us to explore the role of RNA in ancient life forms, as well as generate new disease biomarkers and therapeutic agents.
Structural elucidation and analysis of fructifications of plants is fundamental for understanding their evolution. In case of Ginkgo biloba, attention was drawn by Fujii in 1896 to aberrant fructifications of Ginkgo biloba whose seeds are attached to leaves, called O-ha-tsuki in Japan. This well-known phenomenon was now interpreted by Fujii as being homologous to ancestral sporophylls. The common fructification of Ginkgo biloba consists of 1-2 (rarely more) ovules on a dichotomously divided stalk, the ovules on top of short stalklets, with collars supporting the ovules. There is essentially no disagreement that either the whole stalk with its stalklets, collars and ovules is homologous to a sporophyll, or, alternatively, just one stalklet, collar and ovule each correspond to a sporophyll. For the transition of an ancestral sporophyll resembling extant O-ha-tsuki aberrant leaves into the common fructification with stalklet/collar/ovule, evolutionary reduction of the leaf lamina of such ancestral sporophylls has to be assumed. Furthermore, such ancestral sporophylls would be expected in the fossil record of ginkgophytes.
From the Upper Permian of the Bletterbach gorge (Dolomites, South Tyrol, Italy) ginkgophyte leaves of the genus Sphenobaiera were discovered. Among several specimens, one shows putatively attached seeds, while other specimens, depending on their state of preservation, show seeds in positions strongly suggesting such attachment. Morphology and results of a cuticular analysis are in agreement with an affiliation of the fossil to the ginkgophytes and the cuticle of the seed is comparable to that of Triassic and Jurassic ones and to those of extant Ginkgo biloba. The Sphenobaiera leaves with putatively attached seeds closely resemble seed-bearing O-ha-tsuki leaves of extant Ginkgo biloba. This leads to the hypothesis that, at least for some groups of ginkgophytes represented by extant Ginkgo biloba, such sporophylls represent the ancestral state of fructifications.
Some evidence is provided for the existence of ancestral laminar ginkgophyte sporophylls. Homology of the newly found fossil ginkgophyte fructifications with the aberrant O-ha-tsuki fructifications of Ginkgo biloba is proposed. This would support the interpretation of the apical part of the common Ginkgo biloba fructification (stalklet/collar/ovule) as a sporophyll with reduced leaf lamina.
All eukaryotes require mitochondria for survival and growth. The origin of mitochondria can be traced down to a single endosymbiotic event between two probably prokaryotic organisms. Subsequent evolution has left mitochondria a collection of heterogeneous organelle variants. Most of these variants have retained their own genome and translation system. In hydrogenosomes and mitosomes, however, the entire genome was lost. All types of mitochondria import most of their proteome from the cytosol, irrespective of whether they have a genome or not. Moreover, in most eukaryotes, a variable number of tRNAs that are required for mitochondrial translation are also imported. Thus, import of macromolecules, both proteins and tRNA, is essential for mitochondrial biogenesis. Here, we review what is known about the evolutionary history of the two processes using a recently revised eukaryotic phylogeny as a framework. We discuss how the processes of protein import and tRNA import relate to each other in an evolutionary context.
mitochondria; protein import; tRNA import; mitosomes; hydrogenosomes
Small nucleolar (sno)RNAs are required for posttranscriptional processing and modification of ribosomal, spliceosomal and messenger RNAs. Their presence in both eukaryotes and archaea indicates that snoRNAs are evolutionarily ancient. The location of some snoRNAs within the introns of ribosomal protein genes has been suggested to belie an RNA world origin, with the exons of the earliest protein-coding genes having evolved around snoRNAs after the advent of templated protein synthesis. Alternatively, this intronic location may reflect more recent selection for coexpression of snoRNAs and ribosomal components, ensuring rRNA modification by snoRNAs during ribosome synthesis. To gain insight into the evolutionary origins of this genetic organization, we examined the antiquity of snoRNA families and the stability of their genomic location across 44 eukaryote genomes.
We report that dozens of snoRNA families are traceable to the Last Eukaryotic Common Ancestor (LECA), but find only weak similarities between the oldest eukaryotic snoRNAs and archaeal snoRNA-like genes. Moreover, many of these LECA snoRNAs are located within the introns of host genes independently traceable to the LECA. Comparative genomic analyses reveal the intronic location of LECA snoRNAs is not ancestral however, suggesting the pattern we observe is the result of ongoing intragenomic mobility. Analysis of human transcriptome data indicates that the primary requirement for hosting intronic snoRNAs is a broad expression profile. Consistent with ongoing mobility across broadly-expressed genes, we report a case of recent migration of a non-LECA snoRNA from the intron of a ubiquitously expressed non-LECA host gene into the introns of two LECA genes during the evolution of primates.
Our analyses show that snoRNAs were a well-established family of RNAs at the time when eukaryotes began to diversify. While many are intronic, this association is not evolutionarily stable across the eukaryote tree; ongoing intragenomic mobility has erased signal of their ancestral gene organization, and neither introns-first nor evolved co-expression adequately explain our results. We therefore present a third model — constrained drift — whereby individual snoRNAs are intragenomically mobile and may occupy any genomic location from which expression satisfies phenotype.
snoRNA; Last Eukaryotic Common Ancestor; Intron; Retrotransposition; Introns-first; Constrained drift
It is generally believed that life first evolved from single-stranded RNA (ssRNA) that both stored genetic information and catalyzed the reactions required for self-replication.
Presentation of the hypothesis
By modeling early genome evolution on the engineering paradigm design-by-contract, an alternative scenario is presented in which life started with the appearance of double-stranded RNA (dsRNA) as an informational storage molecule while catalytic single-stranded RNA was derived from this dsRNA template later in evolution.
Testing the hypothesis
It was investigated whether this scenario could be implemented mechanistically by starting with abiotic processes. Double-stranded RNA could be formed abiotically by hybridization of oligoribonucleotides that are subsequently non-enzymatically ligated into a double-stranded chain. Thermal cycling driven by the diurnal temperature cycles could then replicate this dsRNA when strands of dsRNA separate and later rehybridize and ligate to reform dsRNA. A temperature-dependent partial replication of specific regions of dsRNA could produce the first template-based generation of catalytic ssRNA, similar to the developmental gene transcription process. Replacement of these abiotic processes by enzymatic processes would guarantee functional continuity. Further transition from a dsRNA to a dsDNA world could be based on minor mutations in template and substrate recognition sites of an RNA polymerase and would leave all existing processes intact.
Implications of the hypothesis
Modeling evolution on a design pattern, the 'dsRNA first' hypothesis can provide an alternative mechanistic evolutionary scenario for the origin of our genome that preserves functional continuity.
This article was reviewed by Anthony Poole, Eugene Koonin and Eugene Shakhnovich
Accurate reconstruction of phylogenies remains a key challenge in evolutionary biology. Most biologically plausible formulations of the problem are formally NP-hard, with no known efficient solution. The standard in practice are fast heuristic methods that are empirically known to work very well in general, but can yield results arbitrarily far from optimal. Practical exact methods, which yield exponential worst-case running times but generally much better times in practice, provide an important alternative. We report progress in this direction by introducing a provably optimal method for the weighted multi-state maximum parsimony phylogeny problem. The method is based on generalizing the notion of the Buneman graph, a construction key to efficient exact methods for binary sequences, so as to apply to sequences with arbitrary finite numbers of states with arbitrary state transition weights. We implement an integer linear programming (ILP) method for the multi-state problem using this generalized Buneman graph and demonstrate that the resulting method is able to solve data sets that are intractable by prior exact methods in run times comparable with popular heuristics. We further show on a collection of less difficult problem instances that the ILP method leads to large reductions in average-case run times relative to leading heuristics on moderately hard problems. Our work provides the first method for provably optimal maximum parsimony phylogeny inference that is practical for multi-state data sets of more than a few characters.
combinatorial optimization; linear programming; molecular evolution; phylogenetic trees
A hypothetical evolutionary pathway from a ribozyme to a catalytic RNA–protein complex (RNP) is proposed and examined. In this hypothesis for an early phase of molecular evolution, one RNA–RNA interaction in the starting ribozyme is replaced with an RNA–protein interaction via two intermediary stages. At each stage, the original RNA–RNA interaction and a newly introduced RNA–protein interaction are designed to coexist. The catalytic RNPs corresponding to the intermediary stages were constructed by employing the Tetrahymena ribozyme together with molecular modeling. Analyses of the RNPs indicate that the protein can fully replace the original role of the RNA–RNA interaction in the starting ribozyme and that the association of a protein with a ribozyme might be beneficial for improving the ribozymatic activity.
A little-noted result of the confirmation of multiple premises of the RNA-world hypothesis is that we now know something about the dawn organisms that followed the origin of life, perhaps over 4 billion years ago. We are therefore in an improved position to reason about the biota just before RNA times, during the era of the first replicators, the first Darwinian creatures on Earth. An RNA congener still prominent in modern biology is a plausible descendent of these first replicators.
Nucleotide cofactors such as NADP could represent relics of pre-RNA life on Earth. They may have been key elements of the first replicators, subsequently co-opted by RNA and then proteins.
All living organisms encode the 20 natural amino acid units of polypeptides using a universal scheme of triplet nucleotide “codons”. Disparate features of this codon scheme are potentially informative of early molecular evolution: (i) the absence of any codons for d-amino acids; (ii) the odd combination of alternate codon patterns for some amino acids; (iii) the confinement of synonymous positions to a codon’s third nucleotide; (iv) the use of 20 specific amino acids rather than a number closer to the full coding potential of 64; and (v) the evolutionary relationship of patterns in stop codons to amino acid codons. Here I propose a model for an ancestral proto-anti-codon RNA (pacRNA) auto-aminoacylation system and show that pacRNAs would naturally manifest features of the codon table. I show that pacRNAs could implement all the steps for auto-aminoacylation: amino acid coordination, intermediate activation of the amino acid by the 5′-end of the pacRNA, and 3′-aminoacylation of the pacRNA. The anti-codon cradles of pacRNAs would have been able to recognize and coordinate only a small number of l-amino acids via hydrogen bonding. A need for proper spatial coordination would have limited the number of chargeable amino acids for all anti-codon sequences, in addition to making some anti-codon sequences unsuitable. Thus, the pacRNA model implies that the idiosyncrasies of the anti-codon table and l-amino acid homochirality co-evolved during a single evolutionary period. These results further imply that early life consisted of an aminoacylated RNA world with a richer enzymatic potential than ribonucleotides alone.
RNA world; Proto-anti-codons; Homochirality; Molecular evolution; Ribozymes
ATP occupies a central position in biology, for it is both an elementary building block of RNA and the most widely used cofactor in all living organisms. For this reason, it has been a recurrent target for in vitro molecular evolution techniques. The exploration of ATP-binding motifs constitutes both an important step in investigating the plausibility of the ‘RNA world’ hypothesis and a central starting point for the development of new enzymes. To date, only two RNA motifs that bind ATP have been characterized. The first one is targeted to the adenosine moiety, while the second one recognizes the ‘Hoogsteen’ face of the base. To isolate aptamers that bind ATP in different orientations, we selected RNAs on an affinity resin that presents ATP in three different orientations. We obtained five new motifs that were characterized and subsequently submitted to a secondary selection protocol designed to isolate aptamers specific for cordycepin. Interestingly, all the ATP-binding motifs selected specifically recognize the sugar-phosphate backbone region of the nucleotides. Three of the aptamers show some selectivity for adenine derivatives, while the remainder recognize any of the four nucleotides with similar efficiency. The characteristics of these aptamers are discussed along with implications for in vitro molecular evolution.