RIG-I is a complex multidomain molecule and much work has been done over the past years to address the roles of the individual domains in sensing and signalling. All three domain families of RIG-I i.e. CARDs, SF2 and RD have been found to be involved in determining the specificity of RIG-I for 5’-triphosphate containing dsRNA, suggesting a global structural change in RIG-I from a
signal off into
signal on states. For instance, although CARDs are the signalling module of RIG-I, they are also important for the RNA selectivity of RIG-I. ΔCARD-RIG-I is much more efficiently stimulated
in vitro by the non-phosphorylated dsRNA than wtRIG-I, and looses specificity for 5’-triphosphate-RNA (
Cui et al., 2008). The two main RNA binding sites are located on the SF2 domain and RD. The latter senses the 5’-triphosphate moiety on RNAs. The cysteine rich RDs are a unique hallmark and defining feature of RLRs, although only the RD of RIG-I has been found to bind specifically 5′-triphosphate-RNA (). The RD, first described by Michael Gale and coworkers, is important for the activity of the enzyme
in vivo and
in vitro (
Saito et al., 2007). Structural studies indicated that RDs from RIG-I, MDA5 and LGP2 are fold-related to each other (), but bear little sequence homology to other known protein domains (
Cui et al., 2008;
Li et al., 2009a;
Li et al., 2009b;
Lu et al., 2010a;
Lu et al., 2010b;
Pippig et al., 2009;
Takahasi et al., 2009;
Takahasi et al., 2008;
Wang et al., 2010). However, some structural relatives of RD could be identified in the Protein Data Bank. RD is fold-related to the C-terminal methionine sulfoxide reductase domain of PilB (PDB entry 1L1D) and MSS4, a GDP/GTP exchange factor for small Rab-like GTPases (PDB entry 2FU5). Interestingly, MSS4 stimulates nucleotide exchange by structurally modulating the nucleotide-binding site of the Rab8 GTPase (
Itzen et al., 2006) and it will be interesting to analyse whether MSS4 and RD have a related regulatory effect on their functionally associated NTP binding domain (
Cui et al., 2008).
After it was established that RIG-I RD senses the 5’-triphosphate moiety, further work established that RDs of RIG-I, LGP2 and MDA5 are in fact RNA binding domains with specificity for dsRNA ends (
Li et al., 2009b;
Lu et al., 2010a;
Lu et al., 2010b;
Wang et al., 2010). In the case of RIG-I these are dsRNA ends bearing a 5’-triphosphate moiety, while the RDs of LGP2 and MDA5 bind dsRNA ends with or without 5’-triphosphates. The binding activity of MDA5’s RD has not been analysed in more detail because it shows comparatively weak affinity for RNA, but there is circumstantial evidence that MDA5 and LGP2 might cooperate and also MDA5 binds dsRNA ends (
Takahasi et al., 2009). Structural studies indicate that RDs from all three RLRs possess a three-leafed β-sheet structure with short connecting helices. The three leaves are held together and stabilised by a metal binding site, formed by four invariant cysteines that coordinate a zinc ion. The arrangement of the three leaves creates a shallow groove on the concave side of RDs that carries the main positive electrostatic potential and forms the binding site for RNA ends. Structures of RNA complexes are now available for RIG-I’s RD in complex with 5’-tri/diphosphate dsRNA as well as blunt end dsRNA without phosphate moieties and show how RD’s sense nucleic acids ().
These studies indicated that RDs bind dsRNA with a fairly conserved mode of recognition, although some differences can be observed. The core interaction is formed by recognising predominantly the backbone of the last two (RIG-I) or three (LGP2) bases from the 5’-end, while only the very end of the 3’ contributes to binding. Here, a tighter interaction of LGP2 with the 3’ terminus is observed, while RIG-I appears to bind the 5’-terminus more tightly. The 5’-terminal strand is mainly bound at the core binding site, while the recognition of the RNA end structures (5′-triphosphates, pi-system of the bases, sugar of the 3’-end) is mediated to a substantial extent by a specificity loop that is markedly distinct between RIG-I and LGP2. RIG-I is specifically adapted by having several positively charged residues in this region that are critical for counteracting the negative charge of the triphosphate moiety (
Li et al., 2009b;
Lu et al., 2010a;
Lu et al., 2010b;
Wang et al., 2010). Binding of RNAs containing 5’-tri/diphosphates is somewhat different to recognition of unphosphorylated RNA ends (
Lu et al., 2010a;
Lu et al., 2010b;
Wang et al., 2010). This different recognition could, in addition to a diminished binding energy, add to the enzymes specificity. This is consistent with observations that different ligands differentially affect the overall conformation of RIG-I (
Ranjith-Kumar et al., 2009).
In vivo, RIG-I is found in multimeric complexes as it signals downstream but due to the many proteins involved in such structures, it is unclear whether these multimers present a defined protein complex or are the result of co-localising to viral RNA (
Saito et al., 2007).
In vitro studies found that in the presence of 5’-triphophate-dsRNA as well as unphosphorylated dsRNA, RIG-I forms a homogenous, stable dimer, while in the absence of the ligand, RIG-I is monomeric (
Cui et al., 2008;
Ranjith-Kumar et al., 2009). ΔCARD-RIG-I also shows pppRNA dependent dimerisation (
Cui et al., 2008), suggesting that dimer formation occurs via a CARD-independent mechanism (
Ranjith-Kumar et al., 2009). A low-resolution structure of the ΔCARD-RIG-I dimer was obtained using negative stain electron microscopy (
Ranjith-Kumar et al., 2009). Thus, a correct dimer with multiple protein RNA interactions might be a prerequisite for appropriate signal transduction. It is worth noting, that downstream signalling as well as RNA binding by other members of the RLR-pathway also involves protein dimerization (
Baril et al., 2009;
Murali et al., 2008;
Tang and Wang, 2009). Thus, ligand induced self-association of pattern receptors, followed by association-stimulated activation of signal transduction / effector enzymes could be a central activation mechanism also employed by RLRs.
If PAMPs dimerise RIG-I much like PAMPs dimerise TLRs, why does RIG-I contain a SF2 domain in addition to RDs? There are several possibilities. For instance, the ATPase function could provide improved specificity, because RIG-I has a more difficult sensing task to perform in the RNA rich environment of the cytoplasm than TLRs in the endosome, which is devoid of self nucleic acids. Another function could be direct competition with viral proteins, such as a protein displacement activity on RNA or RNA unwinding. Indeed, some unwinding activity has been demonstrated by RIG-I, but it appears to be less robust than one would expect it to be for a
bona fide helicase (
Takahasi et al., 2008). SF2 enzymes share conserved sequence motifs (somehow often misnamed as “helicase motifs”) that mediate ATP and nucleic acid binding. In general, ATP-binding and –hydrolysis induced conformational changes between two subdomains of the SF2-fold repositions two nucleic acid binding sites on the surface of the SF2 domain that leads to dynamic, differential nucleic acid interactions. Depending on the precise biochemical mechanism this conformational power stroke leads to a directional transport of nucleic acids, unwinding of nucleic acid structures, or grip of nucleic acids. For instance, in NS3 of hepatitis C virus, the ATP cycle directionally advances the enzyme by single base(pair) steps on product ssRNA, leading to unwinding of adjacent nucleic acid structures or displacement of proteins bound to substrate RNA (
Dumont et al., 2006;
Myong et al., 2007).
While multimerisation is probably a key part of the RIG-I activation mechanism, for instance by subsequent multimerisation of IPS1 and downstream factors (
Baril et al., 2009), dimer formation
in vitro does not depend on ATP (
Cui et al., 2008). Based on this, an additional activation step is required for downstream signalling by RIG-I, because RIG-I signalling in general requires intact ATP-binding motifs (
Bamming and Horvath, 2009;
Yoneyama et al., 2004). What is the functional role of the SF domain and its RNA stimulated ATPase activity? A first hint came from the observation that the ATPase activity of the isolated RIG-I SF2 domain is stimulated very efficiently by dsRNA but not by ssRNA (
Cui et al., 2008). Often, “helicases” translocate on the DNA/RNA substrate that activates their ATPase activity, suggesting that RIG-I is a dsRNA translocase. In fact, a robust dsRNA translocation activity of RIG-I could be shown using single-molecule studies (
Myong et al., 2009). While double-stranded RNA translocation activity of both wtRIG-I and ΔCARD-RIG-I are observed, these experiments did not reveal substantial RNA unwinding activity. Interestingly, RIG-I is much slower on a generic dsRNA substrate than ΔCARD-RIG-I, which confirms the inhibitory role of the CARDs. In the presence of 5’-triphosphate, this inhibition is lifted and the enzyme translocates efficiently on dsRNA. Thus, RIG-I integrates both 5’-triphosphate and dsRNA patterns by a cooperation of RD and SF2 domain, consistent with its optimal ligands in cellular assays.
What is the biological relevance of RIG-I translocation activity? RIG-I preferentially binds dsRNA that contain 5’-triphosphate. Both PAMPs are features of many replicating RNA viruses. The dsRNA translocation activity on 5’-triphosphate RNA would serve as a dual layer read out of viral PAMPs. Initial recognition is governed by the 5’-triphosphate. Subsequently, RIG-I forms dimers and starts to translocate on a nearby dsRNA stretch. This translocation might lead to exposed CARDs, thus creating a signalling conformation for downstream interactions (). The signal strength may be related to the amount of time spent in the translocation mode and therefore to the length of RNA. Such a model might explain how RIG-I and MDA5 may differentially read out very long dsRNA regions. However, RIG-I efficiently detects also rather short RNA, so only a local translocation step seems to be required for activity. We currently favor a model where a short translocation by few base pairs switches CARDs into a conformation that is recognised by factors such as Trim25 and IPS1 (). Alternatively, the translocation could be a proofreading activity to enable repeated binging to the 5’-triphosphate moiety. This would explain why mutations in some helicase motifs induces constitutive signalling, however, it does not explain why mutations in other ATPase motifs disrupt signalling (
Bamming and Horvath, 2009). Finally, ATP driven protein translocation on viral dsRNA might effectively interfere with viral proteins by preventing them from binding, blocking their progression, or displacing them, thus actively interfering with viral replication. Such a function might be important for LGP2, because it appears to act upstream of RIG-I and MDA5 for viruses with a tightly packaged genomic RNA and less for viruses with a more loosely packaged genomic RNA. Further studies that correlate the features of the SF2 and its translocation activity with RIG-I dependent interferon stimulation are required.
Signalling by RIG-I also depends on ubiquitylation of CARDs and RD by TRIM25 and RFN135, respectively, presumably manifesting a signalling conformation or providing an additional recognition platform for downstream factors (
Gack et al., 2007;
Oshiumi et al., 2009). Using a reconstituted system with partially purified components, it was also found that K63-linked ubiquitin chains can also stimulate RIG-I mediated signalling and that RIG-I interacts with these chains (
Zeng et al., 2010). A three-layered read out (5′-triphosphate, dsRNA, ubiquitylation/ubiquitin chain association) is presumably necessary to sense virus replication robustly and to avoid premature innate immune responses due to sporadic dsRNA or sporadic 5’-triphosphates on normal cellular RNA. Alternatively, a replicating virus may only be detectable in a very short time window and a variety of viral countermeasures interfere with RIG-I sensing. Thus, it is possible that this short time window needs manifestation i.e. through an ubiquitylation reaction.
Several key questions, however, need to be addressed to fully understand the mechanism of RIG-I signalling. For instance, what are the two signal-off and signal-on conformations of RIG-I, what is the molecular basis for the RIG-I:IPS1 interaction, how does this interaction activate IPS1 or recruit downstream factors. Furthermore, what is the exact molecular role of ubiquitin chains or ubiquitylation? What is the function of the RIG-I translocase activity and how does the translocation reaction lead to IPS1 activation?