Free group II introns are infectious retroelements that can bind and insert themselves into RNA and DNA molecules via reverse-splicing. Here we report the 3.4Å crystal structure of a complex between an oligonucleotide target substrate and a group IIC intron, as well as the refined free intron structure. The structure of the complex reveals the conformation of motifs involved in exon recognition by group II introns.
Host cell invasion is monitored by a series of pattern recognition receptors (PRRs) that activate the innate immune machinery upon detection of a cognate pathogen associated molecular pattern (PAMP). The RIG-I like receptor (RLR) family of PRRs includes three proteins — RIG-I, MDA5, and LGP2 — responsible for the detection of intracellular pathogenic RNA. All RLR proteins are built around an ATPase core homologous to those found in canonical Superfamily 2 (SF2) RNA helicases, which has been modified through the addition of novel accessory domains to recognize duplex RNA. This review focuses on the structural bases for pathogen-specific dsRNA binding and ATPase activation in RLRs, differential RNA recognition by RLR family members, and implications for other duplex RNA activated ATPases, such as Dicer.
Double-stranded RNAs are an important class of functional macromolecules in living systems. They are usually found as part of highly specialized intracellular machines that control diverse cellular events, ranging from virus replication, antiviral defense, RNA interference, to regulation of gene activities and genomic integrity. Within different intracellular machines, the RNA duplex is often found in association with specific RNA-dependent ATPases, including Dicer, RIG-I and DRH-3 proteins. These duplex RNA-activated ATPases represent an emerging group of motor proteins within the large and diverse super family 2 nucleic acid-dependent ATPases (which are historically defined as SF2 helicases). The duplex RNA-activated ATPases share characteristic molecular features for duplex RNA recognition, including motifs (e.g., motifs IIa and Vc) and an insertion domain (HEL2i), and they require double-strand RNA binding for their enzymatic activities. Proteins in this family undergo large conformational changes concomitant with RNA binding, ATP binding and ATP hydrolysis in order to achieve their functions, which include the release of signaling domains and the recruitment of partner proteins. The duplex RNA-activated ATPases represent a distinct and fascinating group of nanomechanical molecular motors that are essential for duplex RNA sensing and processing in diverse cellular pathways.
ATPases; antiviral immunity; Dicer; helicase; molecular motor; RIG-I like receptors; RNAi
The D135 group II intron ribozyme follows a unique folding pathway that is direct and appears to be devoid of kinetic traps. During the earliest stages of folding, D135 collapses slowly to a compact intermediate, and all subsequent assembly events are rapid. Collapse of intron Domain 1 (D1) has been shown to limit the rate constant for D135 folding, although the specific substructure of the D1 kinetic intermediate has not yet been identified. Employing time-resolved Nucleotide Analog Interference Mapping (NAIM), we have identified a cluster of atoms within the D1 main stem that control the rate constant for D135 collapse. Functional groups within the κ–ζ element are particularly important for this earliest stage of folding, which is intriguing given that this same motif also serves later as the docking site for catalytic Domain 5 (D5). Importantly, the κ–ζ element is shown to be a divalent ion binding pocket, indicating that this region is a Mg2+-dependent switch that initiates the cascade of D135 folding events. By measuring the Mg2+ dependence of the compaction rate constant, we conclude that the actual rate-limiting step in D1 compaction involves the formation of an unstable folding intermediate that is captured by the binding of Mg2+. This carefully orchestrated folding pathway, in which formation of an active-site docking region is early and rate-limiting, ensures proper folding of the intron core and faithful splicing. It may represent an important paradigm for the folding of large, multidomain RNA molecules.
ribozyme; RNA folding; catalysis; splicing; RNA structure
Recurring RNA structural motifs are important sites of tertiary interaction and as such, are integral to RNA macromolecular structure. Although numerous RNA motifs have been classified and characterized, the identification of new motifs is of great interest. In this study, we discovered four new conformationally recurring motifs: the π-turn, the Ω-turn, the α-loop and the C2′-endo mediated flipped adenosine motif. Not only do they have complex and interesting structures, but they participate in contacts of high biological significance. In a first for the RNA field, new motifs were discovered by a fully automated algorithm. This algorithm, COMPADRES, utilized a reduced representation of the RNA backbone and was highly successful at discerning unique structural relationships. This study also shows that recurring RNA substructures are not necessarily accompanied by consistent primary or secondary structure.
Dicer is a specialized nuclease that produces RNA molecules of specific lengths for use in gene silencing pathways. Dicer relies on the correct measurement of RNA target duplexes to generate products of specific lengths. It is thought that Dicer uses its multidomain architecture to calibrate RNA product length. However, this measurement model is derived from structural information from a protozoan Dicer, and does not account for the helicase domain present in higher organisms. The Caenorhabditis elegans Dicer-related helicase 3 (DRH-3) is an ortholog of the Dicer and RIG-I family of double-strand RNA activated ATPases essential for secondary siRNA production. We find that DRH-3 specifies 22 bp RNAs by dimerization of the helicase domain, a process mediated by ATPase activity and the N-terminal domain. This mechanism for RNA length discrimination by a Dicer family protein suggests an alternative model for RNA length measurement by Dicer, with implications for recognition of siRNA and miRNA targets.
It has become apparent that much of cellular metabolism is controlled by large well-folded noncoding RNA molecules. In addition to crystallographic approaches, computational methods are needed for visualizing the 3D structure of large RNAs. Here, we modeled the molecular structure of the ai5γ group IIB intron from yeast using the crystal structure of a bacterial group IIC homolog. This was accomplished by adapting strategies for homology and de novo modeling, and creating a new computational tool for RNA refinement. The resulting model was validated experimentally using a combination of structure-guided mutagenesis and RNA structure probing. The model provides major insights into the mechanism and regulation of splicing, such as the position of the branch-site before and after the second step of splicing, and the location of subdomains that control target specificity, underscoring the feasibility of modeling large functional RNA molecules.
Strategies for phasing nucleic acid structures by molecular replacement, using both experimental and de novo designed models, are discussed.
Structured RNA molecules are key players in ensuring cellular viability. It is now emerging that, like proteins, the functions of many nucleic acids are dictated by their tertiary folds. At the same time, the number of known crystal structures of nucleic acids is also increasing rapidly. In this context, molecular replacement will become an increasingly useful technique for phasing nucleic acid crystallographic data in the near future. Here, strategies to select, create and refine molecular-replacement search models for nucleic acids are discussed. Using examples taken primarily from research on group II introns, it is shown that nucleic acids are amenable to different and potentially more flexible and sophisticated molecular-replacement searches than proteins. These observations specifically aim to encourage future crystallographic studies on the newly discovered repertoire of noncoding transcripts.
nucleic acid sequence homology; de novo structure design; long noncoding RNA; RNA structure; homology modeling; RCrane
Defining the functional determinants for RNA surveillance by RIG-I
This study shows that HEL2i-domain mediated scanning allows RIG-I to sense the length of RNA targets, and a short RNA duplex that binds one RIG-I molecule can stimulate ATPase activity and interferon response.
Retinoic acid-inducible gene-I (RIG-I) is an intracellular RNA sensor that activates the innate immune machinery in response to infection by RNA viruses. Here, we report the crystal structure of distinct conformations of a RIG-I:dsRNA complex, which shows that HEL2i-mediated scanning allows RIG-I to sense the length of RNA targets. To understand the implications of HEL2i scanning for catalytic activity and signalling by RIG-I, we examined its ATPase activity when stimulated by duplex RNAs of varying lengths and 5′ composition. We identified a minimal RNA duplex that binds one RIG-I molecule, stimulates robust ATPase activity, and elicits a RIG-I-mediated interferon response in cells. Our results reveal that the minimal functional unit of the RIG-I:RNA complex is a monomer that binds at the terminus of a duplex RNA substrate. This behaviour is markedly different from the RIG-I paralog melanoma differentiation-associated gene 5 (MDA5), which forms cooperative filaments.
ATP hydrolysis; innate immunity; RNA helicase; X-ray crystallography
Group II introns are self-splicing catalytic RNAs that are thought to be ancestral to the
spliceosome. Here we report the 3.65 Å crystal structure of the group II intron from
Oceanobacillus iheyensis in the pre-catalytic state. The structure reveals the
conformation of the 5′ splice site in the catalytic core and represents the first structure
of an intron prior to the first step of splicing.
Group II introns are mobile genetic elements that self-splice and retrotranspose into DNA and RNA. They are considered evolutionary ancestors of the spliceosome, the ribonucleoprotein complex essential for pre-mRNA processing in higher eukaryotes. Over a 20-year period, group II introns have been characterized first genetically, then biochemically, and finally by means of X-ray crystallography. To date, 17 crystal structures of a group II intron are available, representing five different stages of the splicing cycle. This review provides a framework for classifying and understanding these new structures in the context of the splicing cycle. Structural and functional implications for the spliceosome are also discussed.
Retrotransposition; Spliceosome; X-ray structures; RNA catalysis; Metal ions
Group II introns are self-splicing ribozymes that share a reaction mechanism and a common ancestor with the eukaryotic spliceosome, thereby providing a model system for understanding the chemistry of pre-mRNA splicing. Here we report fourteen crystal structures of a group II intron at different stages of catalysis. We provide a detailed mechanism for the first step of splicing, we describe a reversible conformational change between the first and the second steps of splicing, and we present the ligand-free intron structure after splicing, in an active state that corresponds to the retrotransposable form of the intron. During each reaction, the reactants are aligned and activated by a heteronuclear four-metal-ion center that contains a metal cluster and obligate monovalent cations, adopting a structural arrangement similar to that of protein endonucleases. Based on our data, we propose a model for the splicing cycle and show that it is applicable to the eukaryotic spliceosome.
The discovery that RNA molecules can fold into complex structures and carry out diverse cellular roles has led to interest in developing tools for modeling RNA tertiary structure. While significant progress has been made in establishing that the RNA backbone is rotameric, few libraries of discrete conformations specifically for use in RNA modeling have been validated. Here, we present six libraries of discrete RNA conformations based on a simplified pseudo-torsional notation of the RNA backbone, comparable to phi and psi in the protein backbone. We evaluate the ability of each library to represent single nucleotide backbone conformations and we show how individual library fragments can be assembled into dinucleotides that are consistent with established RNA backbone descriptors spanning from sugar to sugar. We then use each library to build all-atom models of 20 test folds and we show how the composition of a fragment library can limit model quality. Despite the limitations inherent in using discretized libraries, we find that several hundred discrete fragments can rebuild RNA folds up to 174 nucleotides in length with atomic-level accuracy (<1.5Å RMSD). We anticipate the libraries presented here could easily be incorporated into RNA structural modeling, analysis, or refinement tools.
RNA structure; RNA backbone conformation; RNA fragment library; RNA modeling
Background: RIG-I is an essential innate immune receptor that detects viral RNAs in infected cells.
Results: RIG-I uses distinct subdomains to recognize specific characteristics of viral RNAs.
Conclusion: The 5′-triphosphate is critical for high affinity RIG-I/RNA interaction.
Significance: Characterizing the RIG-I/RNA interface is essential for understanding early stages of immune response against RNA viruses.
RIG-I is a cytoplasmic surveillance protein that contributes to the earliest stages of the vertebrate innate immune response. The protein specifically recognizes 5′-triphosphorylated RNA structures that are released into the cell by viruses, such as influenza and hepatitis C. To understand the energetic basis for viral RNA recognition by RIG-I, we studied the binding of RIG-I domain variants to a family of dsRNA ligands. Thermodynamic analysis revealed that the isolated RIG-I domains each make important contributions to affinity and that they interact using different strategies. Covalent linkage between the domains enhances RNA ligand specificity while reducing overall binding affinity, thereby providing a mechanism for discriminating virus from host RNA.
ATPases; Interferon; RNA Helicase; RNA-Protein Interaction; Viral Immunology; RNA Triphosphate
RCrane is a new tool for the partially automated building of RNA crystallographic models into electron-density maps of low or intermediate resolution. This tool helps crystallographers to place phosphates and bases into electron density and then automatically predicts and builds the detailed all-atom structure of the traced nucleotides.
RNA crystals typically diffract to much lower resolutions than protein crystals. This low-resolution diffraction results in unclear density maps, which cause considerable difficulties during the model-building process. These difficulties are exacerbated by the lack of computational tools for RNA modeling. Here, RCrane, a tool for the partially automated building of RNA into electron-density maps of low or intermediate resolution, is presented. This tool works within Coot, a common program for macromolecular model building. RCrane helps crystallographers to place phosphates and bases into electron density and then automatically predicts and builds the detailed all-atom structure of the traced nucleotides. RCrane then allows the crystallographer to review the newly built structure and select alternative backbone conformations where desired. This tool can also be used to automatically correct the backbone structure of previously built nucleotides. These automated corrections can fix incorrect sugar puckers, steric clashes and other structural problems.
RCrane; RNA model building
Intracellular RIG-I-like receptors (RLRs, including RIG-I, MDA-5, and LGP-2) recognize viral RNAs as pathogen-associated molecular patterns (PAMPs) and initiate an antiviral immune response. To understand the molecular basis of this process, we determined the crystal structure of RIG-I in complex with double-stranded RNA. The dsRNA is sheathed within a network of protein domains that include a conserved “helicase” domain (regions HEL1 and HEL2), a specialized insertion domain (HEL2i), and a C-terminal regulatory domain (CTD). A V-shaped pincer connects HEL2 and the CTD by gripping an α-helical shaft that extends from HEL1. In this way, the pincer coordinates functions of all the domains and couples RNA binding with ATP hydrolysis. RIG-I falls within the Dicer-RIG-I clade of super family 2 of helicases and this structure reveals complex interplay between motor domains, accessory mechanical domains and RNA that has implications for understanding the nanomechanical function this protein family and other ATPases more broadly.
RIG-I; RNA helicase; innate immunity; X-ray crystallography
Mss116 is a Saccharomyces cerevisiae mitochondrial DEAD-box RNA helicase protein essential for efficient in vivo splicing of all group I and II introns and activation of mRNA translation. Catalysis of intron splicing by Mss116 is coupled to its ATPase activity. Knowledge of the kinetic pathway(s) and biochemical intermediates populated during RNA-stimulated Mss116 ATPase is fundamental for defining how Mss116 ATP utilization is linked to in vivo function. We therefore measured the rate and equilibrium constants underlying Mss116 ATP utilization and nucleotide-linked RNA binding. RNA accelerates the Mss116 steady-state ATPase ~7-fold by promoting rate-limiting ATP hydrolysis, such that Pi release becomes (partially) rate-limiting. RNA binding displays strong thermodynamic coupling to the chemical states of the Mss116-bound nucleotide such that Mss116 with bound ADP-Pi binds RNA more strongly than with bound ADP or in the absence of nucleotide. The predominant biochemical intermediate populated during in vivo steady-state cycling is the strong RNA binding, Mss116-ADP-Pi state. Strong RNA binding allows Mss116 to fulfill its biological role in stabilization of group II intron folding intermediates. ATPase cycling allows for transient population of the weak RNA binding, ADP state of Mss116 and linked dissociation from RNA, which is required for the final stages of intron folding. In cases where Mss116 functions as a helicase, the data collectively favor a model in which ATP hydrolysis promotes a weak-to-strong RNA binding transition that disrupts stable RNA duplexes. The subsequent strong-to-weak RNA binding transition associated with Pi release dissociates RNA-Mss116 complexes, regenerating free Mss116.
RNA helicase; ATPase cycle; kinetics; fluorescence correlation spectroscopy (FCS)
Hepatitis C virus NS3-4A is a membrane-bound enzyme complex that exhibits serine protease, RNA helicase, and RNA-stimulated ATPase activities. This enzyme complex is essential for viral genome replication and has been recently implicated in virus particle assembly. To help clarify the role of NS4A in these processes, we conducted alanine scanning mutagenesis on the C-terminal acidic domain of NS4A in the context of a chimeric genotype 2a reporter virus. Of 13 mutants tested, two (Y45A and F48A) had severe defects in replication, while seven (K41A, L44A, D49A, E50A, M51A, E52A, and E53A) efficiently replicated but had severe defects in virus particle assembly. Multiple strategies were used to identify second-site mutations that suppressed these NS4A defects. The replication defect of NS4A F48A was partially suppressed by mutation of NS4B I7F, indicating that a genetic interaction between NS4A and NS4B contributes to RNA replication. Furthermore, the virus assembly defect of NS4A K41A was suppressed by NS3 Q221L, a mutation previously implicated in overcoming other virus assembly defects. We therefore examined the known enzymatic activities of wild-type or mutant forms of NS3-4A but did not detect specific defects in the mutants. Taken together, our data reveal interactions between NS4A and NS4B that control genome replication and between NS3 and NS4A that control virus assembly.
Multiple studies hypothesize that DEAD-box proteins facilitate folding of the ai5γ group II intron. However, these conclusions are generally inferred from splicing kinetics, and not from direct monitoring of DEAD-box protein-facilitated folding of the intron. Using native gel electrophoresis and DMS structural probing, we monitored Mss 116-facilitated folding of ai5γ intron ribozymes and a catalytically active self-splicing RNA containing full length intron and short exons. We found that the protein directly stimulates folding of these RNAs by accelerating formation of the compact near-native state. This process occurs in an ATP-independent manner, although, ATP is required for the protein turnover. As Mss 116 binds RNA non-specifically, most of binding events do not result in the formation of the compact state, and ATP is required for the protein to dissociate from such non-productive complexes and rebind the unfolded RNA. Results obtained from experiments at different concentrations of magnesium ions suggest that Mss 116 stimulates folding of ai5γ ribozymes by promoting the formation of unstable folding intermediates, which is then followed by a cascade of folding events resulting in the formation of the compact near-native state. DMS probing results suggest that the compact state formed in the presence of the protein is identical to the near-native state formed more slowly in its absence. Our results also indicate that Mss 116 does not stabilize the native state of the ribozyme, but that such stabilization results from binding of attached exons.
ribozyme; RNA folding; DEAD-box protein; tertiary structure
Group II introns are self-splicing ribozymes that excise themselves from precursor RNAs and catalyze the joining of flanking exons. Excised introns can behave as parasitic RNA molecules, catalyzing their own insertion into DNA and RNA via a reverse-splicing reaction. Previous studies have identified mechanistic roles for various functional groups located in the catalytic core of the intron and within target molecules. Here we introduce a new method for synthesizing long RNA molecules with a modified nucleotide at the 3′-terminus. This modification allows us to examine the mechanistic role of functional groups adjacent to the reaction nucleophile. During reverse-splicing, the 3′-OH group of the intron terminus attacks the phosphodiester linkage of spliced exon sequences. Here we show that the adjacent 2′-OH group on the intron terminus plays an essential role in activating the nucleophile by stripping away a proton from the 3′-OH and then shuttling it from the active-site.
The autocatalytic group II intron ai5γ from Saccharomyces cerevisiae self-splices under high-salt conditions in vitro, but requires the assistance of the DEAD-box protein Mss116 in vivo and under near-physiological conditions in vitro. Here, we show that Mss116 influences the folding mechanism in several ways. By comparing intron precursor RNAs with long (∼300 nt) and short (∼20 nt) exons, we observe that long exon sequences are a major obstacle for self-splicing in vitro. Kinetic analysis indicates that Mss116 not only mitigates the inhibitory effects of long exons, but also assists folding of the intron core. Moreover, a mutation in conserved Motif III that impairs unwinding activity (SAT → AAA) only affects the construct with long exons, suggesting helicase unwinding during exon unfolding, but not in intron folding. Strong parallels between Mss116 and the related protein Cyt-19 from Neurospora crassa suggest that these proteins form a subclass of DEAD-box proteins that possess a versatile repertoire of diverse activities for resolving the folding problems of large RNAs.
RNA helicases are proteins essential to almost every facet of RNA metabolism, including the gene-silencing pathways that employ small RNAs. A phylogenetically related group of helicases is required for the RNA-silencing mechanism in Caenorhabditis elegans. Dicer-related helicase 3 (DRH-3) is a Dicer-RIG-I family protein that is essential for RNA silencing and germline development in nematodes. Here we performed a biochemical characterization of the ligand binding and catalytic activities of DRH-3 in vitro. We identify signature motifs specific to this family of RNA helicases. We find that DRH-3 binds both single-stranded and double-stranded RNAs with high affinity. However, the ATPase activity of DRH-3 is stimulated only by double-stranded RNA. DRH-3 is a robust RNA-stimulated ATPase with a kcat value of 500/min when stimulated with short RNA duplexes. The DRH-3 ATPase may have allosteric regulation in cis that is controlled by the stoichiometry of double-stranded RNA to enzyme. We observe that the DRH-3 ATPase is stimulated only by duplexes containing RNA, suggesting a role for DRH-3 during or after transcription. Our findings provide clues to the role of DRH-3 during the RNA interference response in vivo.
ATPases; Double-stranded RNA; RNA Helicase; RNA Interference (RNAi); siRNA
The superfamily 2 vaccinia viral helicase nucleoside triphosphate phosphohydrolase-II (NPH-II) exhibits robust RNA helicase activity but typically displays little activity on DNA substrates. NPH-II is thus believed to make primary contacts with backbone residues of an RNA substrate. We report an unusual nucleobase bias, previously unreported in any superfamily 1 or 2 helicase, whereby purines are heavily preferred as components of both RNA and DNA tracking strands. The observed sequence bias allows NPH-II to efficiently unwind a DNA·RNA hybrid containing a purine-rich DNA track derived from the 3′-untranslated region of an early vaccinia gene. These results provide insight into potential biological functions of NPH-II and the role of sequence in targeting NPH-II to appropriate substrates. Furthermore, they demonstrate that in addition to backbone contacts, nucleotide bases play an important role in modulating the behavior of NPH-II. They also establish that processive helicase enzymes can display sequence selectivity.
DNA Helicase; Molecular Motors; Protein-Nucleic Acid Interaction; RNA Helicase; Transcription Termination; DExH Helicase; NPH-II; Helicase Activity; Sequence Dependence
Nonstructural protein 3 (NS3) is an essential replicative component of the hepatitis C virus (HCV) and a member of the DExH/D-box family of proteins. The C-terminal region of NS3 (NS3hel) exhibits RNA-stimulated NTPase and helicase activity, while the N-terminal serine protease domain of NS3 enhances RNA binding and unwinding by NS3hel. The nonstructural protein 4A (NS4A) binds to the NS3 protease domain and serves as an obligate cofactor for NS3 serine protease activity. Given its role in stimulating protease activity, we sought to determine whether NS4A also influences the activity of NS3hel. Here we show that NS4A enhances the ability of NS3hel to bind RNA in the presence of ATP, thereby acting as a cofactor for helicase activity. This effect is mediated by amino acids in the C-terminal acidic domain of NS4A. When these residues are mutated, one observes drastic reductions in ATP-coupled RNA binding and duplex unwinding by NS3. These same mutations are lethal in HCV replicons, thereby establishing in vitro and in vivo that NS4A plays an important role in the helicase mechanism of NS3 and its function in replication.