The interaction of viruses and their prokaryotic hosts shaped the evolution of bacterial and archaeal life. Prokaryotes developed several strategies to evade viral attacks that include restriction modification, abortive infection and CRISPR/Cas systems. These adaptive immune systems found in many Bacteria and most Archaea consist of clustered regularly interspaced short palindromic repeat (CRISPR) sequences and a number of CRISPR associated (Cas) genes (Fig. 1)1-3. Different sets of Cas proteins and repeats define at least three major divergent types of CRISPR/Cas systems 4. The universal proteins Cas1 and Cas2 are proposed to be involved in the uptake of viral DNA that will generate a new spacer element between two repeats at the 5' terminus of an extending CRISPR cluster 5. The entire cluster is transcribed into a precursor-crRNA containing all spacer and repeat sequences and is subsequently processed by an enzyme of the diverse Cas6 family into smaller crRNAs 6-8. These crRNAs consist of the spacer sequence flanked by a 5' terminal (8 nucleotides) and a 3' terminal tag derived from the repeat sequence 9. A repeated infection of the virus can now be blocked as the new crRNA will be directed by a Cas protein complex (Cascade) to the viral DNA and identify it as such via base complementarity10. Finally, for CRISPR/Cas type 1 systems, the nuclease Cas3 will destroy the detected invader DNA 11,12 .
These processes define CRISPR/Cas as an adaptive immune system of prokaryotes and opened a fascinating research field for the study of the involved Cas proteins. The function of many Cas proteins is still elusive and the causes for the apparent diversity of the CRISPR/Cas systems remain to be illuminated. Potential activities of most Cas proteins were predicted via detailed computational analyses. A major fraction of Cas proteins are either shown or proposed to function as endonucleases 4.
Here, we present methods to generate crRNAs and precursor-cRNAs for the study of Cas endoribonucleases. Different endonuclease assays require either short repeat sequences that can directly be synthesized as RNA oligonucleotides or longer crRNA and pre-crRNA sequences that are generated via in vitro T7 RNA polymerase run-off transcription. This methodology allows the incorporation of radioactive nucleotides for the generation of internally labeled endonuclease substrates and the creation of synthetic or mutant crRNAs. Cas6 endonuclease activity is utilized to mature pre-crRNAs into crRNAs with 5'-hydroxyl and a 2',3'-cyclic phosphate termini.
Molecular biology; Issue 67; CRISPR/Cas; endonuclease; in vitro transcription; crRNA; Cas6
CRISPR (clustered regularly interspaced short palindromic repeats) elements and cas (CRISPR-associated) genes are widespread in Bacteria and Archaea. The CRISPR/Cas system operates as a defense mechanism against mobile genetic elements (i.e., viruses or plasmids). Here, we investigate seven CRISPR loci in the genome of the crenarchaeon Thermoproteus tenax that include spacers with significant similarity not only to archaeal viruses but also to T. tenax genes. The analysis of CRISPR RNA (crRNA) transcription reveals transcripts of a length between 50 and 130 nucleotides, demonstrating the processing of larger crRNA precursors. The organization of identified cas genes resembles CRISPR/Cas subtype I-A, and the core cas genes are shown to be arranged on two polycistronic transcripts: cascis (cas4, cas1/2, and csa1) and cascade (csa5, cas7, cas5a, cas3, cas3′, and cas8a2). Changes in the environmental parameters such as UV-light exposure or high ionic strength modulate cas gene transcription. Two reconstitution protocols were established for the production of two discrete multipartite Cas protein complexes that correspond to their operonic gene arrangement. These data provide insights into the specialized mechanisms of an archaeal CRISPR/Cas system and allow selective functional analyses of Cas protein complexes in the future.
The CRISPR-Cas (Clustered Regularly Interspaced Short Palindrome Repeats – CRISPR associated proteins) system provides adaptive immunity in archaea and bacteria. A hallmark of CRISPR-Cas is the involvement of short crRNAs that guide associated proteins in the destruction of invading DNA or RNA. We present three fundamentally distinct processing pathways in the cyanobacterium Synechocystis sp. PCC6803 for a subtype I-D (CRISPR1), and two type III systems (CRISPR2 and CRISPR3), which are located together on the plasmid pSYSA. Using high-throughput transcriptome analyses and assays of transcript accumulation we found all CRISPR loci to be highly expressed, but the individual crRNAs had profoundly varying abundances despite single transcription start sites for each array. In a computational analysis, CRISPR3 spacers with stable secondary structures displayed a greater ratio of degradation products. These structures might interfere with the loading of the crRNAs into RNP complexes, explaining the varying abundancies. The maturation of CRISPR1 and CRISPR2 transcripts depends on at least two different Cas6 proteins. Mutation of gene sll7090, encoding a Cmr2 protein led to the disappearance of all CRISPR3-derived crRNAs, providing in vivo evidence for a function of Cmr2 in the maturation, regulation of expression, Cmr complex formation or stabilization of CRISPR3 transcripts. Finally, we optimized CRISPR repeat structure prediction and the results indicate that the spacer context can influence individual repeat structures.
CRISPR (Clustered, Regularly, Interspaced, Short, Palindromic Repeats) loci provide prokaryotes with an adaptive immunity against viruses and other mobile genetic elements. CRISPR arrays can be transcribed and processed into small crRNA molecules, which are then used by the cell to target the foreign nucleic acid. Since spacers are accumulated by active CRISPR/Cas systems, the sequences of these spacers provide a record of the past "infection history" of the organism.
Here we analyzed all currently known spacers present in archaeal genomes and identified their source by DNA similarity. While nearly 50% of archaeal spacers matched mobile genetic elements, such as plasmids or viruses, several others matched chromosomal genes of other organisms, primarily other archaea. Thus, networks of gene exchange between archaeal species were revealed by the spacer analysis, including many cases of inter-genus and inter-species gene transfer events. Spacers that recognize viral sequences tend to be located further away from the leader sequence, implying that there exists a selective pressure for their retention.
CRISPR spacers provide direct evidence for extensive gene exchange in archaea, especially within genera, and support the current dogma where the primary role of the CRISPR/Cas system is anti-viral and anti-plasmid defense.
Open peer review
This article was reviewed by: Profs. W. Ford Doolittle, John van der Oost, Christa Schleper (nominated by board member Prof. J Peter Gogarten)
CRISPR; Lateral Gene transfer; Horizontal gene transfer; viruses; archaea; competence
Phages are the most abundant biological entities on earth and pose a constant challenge to their bacterial hosts. Thus, bacteria have evolved numerous ‘innate’ mechanisms of defense against phage, such as abortive infection or restriction/modification systems. In contrast, the clustered regularly interspaced short palindromic repeats (CRISPR) systems provide acquired, yet heritable, sequence-specific ‘adaptive’ immunity against phage and other horizontally-acquired elements, such as plasmids. Resistance is acquired following viral infection or plasmid uptake when a short sequence of the foreign genome is added to the CRISPR array. CRISPRs are then transcribed and processed, generally by CRISPR associated (Cas) proteins, into short interfering RNAs (crRNAs), which form part of a ribonucleoprotein complex. This complex guides the crRNA to the complementary invading nucleic acid and targets this for degradation. Recently, there have been rapid advances in our understanding of CRISPR/Cas systems. In this review, we will present the current model(s) of the molecular events involved in both the acquisition of immunity and interference stages and will also address recent progress in our knowledge of the regulation of CRISPR/Cas systems.
phages; plasmids; horizontal gene transfer; CRISPR; Cas; cascade; PAM; crRNA; resistance
CRISPR (Clustered, Regularly, Interspaced, Short, Palindromic Repeats) loci have been shown to provide prokaryotes with an adaptive immunity against viruses and plasmids. CRISPR arrays are transcribed and processed into small CRISPR RNA molecules, which base-pair with invading DNA or RNA and lead to its degradation by CRISPR-associated (Cas) protein complexes. New spacers can be acquired by active CRISPR/Cas systems, and thus the sequences of these spacers provide a record of the past “infection history” of the organism. Recently we used spacer sequences from archaeal genomes to infer gene exchange events among archaeal species and genera and to demonstrate that at least in this domain of life CRISPR indeed has an anti-viral role.
CRISPR; Lateral Gene Transfer; archaea; horizontal gene transfer; viruses
CRISPR/Cas (Clustered Regularly Interspaced Short Palindromic Repeats/CRISPR associated sequences) is a recently discovered prokaryotic defense system against foreign DNA, including viruses and plasmids. CRISPR cassette is transcribed as a continuous transcript (pre-crRNA), which is processed by Cas proteins into small RNA molecules (crRNAs) that are responsible for defense against invading viruses. Experiments in E. coli report that overexpression of cas genes generates a large number of crRNAs, from only few pre-crRNAs.
We here develop a minimal model of CRISPR processing, which we parameterize based on available experimental data. From the model, we show that the system can generate a large amount of crRNAs, based on only a small decrease in the amount of pre-crRNAs. The relationship between the decrease of pre-crRNAs and the increase of crRNAs corresponds to strong linear amplification. Interestingly, this strong amplification crucially depends on fast non-specific degradation of pre-crRNA by an unidentified nuclease. We show that overexpression of cas genes above a certain level does not result in further increase of crRNA, but that this saturation can be relieved if the rate of CRISPR transcription is increased. We furthermore show that a small increase of CRISPR transcription rate can substantially decrease the extent of cas gene activation necessary to achieve a desired amount of crRNA.
The simple mathematical model developed here is able to explain existing experimental observations on CRISPR transcript processing in Escherichia coli. The model shows that a competition between specific pre-crRNA processing and non-specific degradation determines the steady-state levels of crRNA and is responsible for strong linear amplification of crRNAs when cas genes are overexpressed. The model further shows how disappearance of only a few pre-crRNA molecules normally present in the cell can lead to a large (two orders of magnitude) increase of crRNAs upon cas overexpression. A crucial ingredient of this large increase is fast non-specific degradation by an unspecified nuclease, which suggests that a yet unidentified nuclease(s) is a major control element of CRISPR response. Transcriptional regulation may be another important control mechanism, as it can either increase the amount of generated pre-crRNA, or alter the level of cas gene activity.
This article was reviewed by Mikhail Gelfand, Eugene Koonin and L Aravind.
CRISPR/Cas; Transcript processing; Small RNA; CRISPR expression regulation; CRISPR/Cas response
The competition between viruses and hosts is played out in all branches of life. Many prokaryotes have an adaptive immune system termed ‘CRISPR’ (clustered regularly interspaced short palindromic repeats) which is based on the capture of short pieces of viral DNA. The captured DNA is integrated into the genomic DNA of the organism flanked by direct repeats, transcribed and processed to generate crRNA (CRISPR RNA) that is loaded into a variety of effector complexes. These complexes carry out sequence-specific detection and destruction of invading mobile genetic elements. In the present paper, we report the structure and activity of a Cas6 (CRISPR-associated 6) enzyme (Sso1437) from Sulfolobus solfataricus responsible for the generation of unit-length crRNA species. The crystal structure reveals an unusual dimeric organization that is important for the enzyme's activity. In addition, the active site lacks the canonical catalytic histidine residue that has been viewed as an essential feature of the Cas6 family. Although several residues contribute towards catalysis, none is absolutely essential. Coupled with the very low catalytic rate constants of the Cas6 family and the plasticity of the active site, this suggests that the crRNA recognition and chaperone-like activities of the Cas6 family should be considered as equal to or even more important than their role as traditional enzymes.
antiviral defence; Cas6; clustered regularly interspaced short palindromic repeats (CRISPR); ribonuclease; Sulfolobus; CRISPR, clustered regularly interspaced short palindromic repeats; Cas, CRISPR-associated; crRNA, CRISPR RNA; Ni-NTA, Ni2+-nitrilotriacetate; PaCas6f, Pseudomonas aeruginosa Cas6; PfuCas6, Pyrococcus furiosus Cas6; RAMP, repeat-associated mysterious protein; RMSD, root mean square deviation; RRM, RNA-recognition motif; SAD, single-wavelength anomalous dispersion; SsoCas6, Sulfolobus solfataricus Cas6; TBE, Tris/borate/EDTA; TEV, tobacco etch virus; TtCas6e, Thermus thermophilus Cas6
Many prokaryotes contain genomic clustered regularly interspaced short palindromic repeats (CRISPRs) that confer resistance to invasive genetic elements. Central to this immune system is the production of CRISPR-derived RNAs (crRNAs) following transcription of the CRISPR locus. Here we identify the endoribonuclease (Csy4) responsible for pre-crRNA processing in Pseudomonas aeruginosa. A 1.8 Å crystal structure of Csy4 in complex with its cognate RNA reveals an unexpected recognition mechanism whereby Csy4 makes sequence-specific interactions in the major groove of the CRISPR repeat stem-loop. Together with electrostatic contacts to the phosphate backbone, these enable Csy4 to selectively bind and cleave pre-crRNAs. The active site of Csy4 comprises two invariant residues, a serine and a histidine. The RNA recognition mechanism identified here explains sequence- and structure-specific processing by a large family of CRISPR-specific endoribonucleases.
Prokaryotes immunize themselves against transmissible genetic elements by the integration (acquisition) in clustered regularly interspaced short palindromic repeats (CRISPR) loci of spacers homologous to invader nucleic acids, defined as protospacers. Following acquisition, mono-spacer CRISPR RNAs (termed crRNAs) guide CRISPR-associated (Cas) proteins to degrade (interference) protospacers flanked by an adjacent motif in extrachomosomal DNA. During acquisition, selection of spacer-precursors adjoining the protospacer motif and proper orientation of the integrated fragment with respect to the leader (sequence leading transcription of the flanking CRISPR array) grant efficient interference by at least some CRISPR-Cas systems. This adaptive stage of the CRISPR action is poorly characterized, mainly due to the lack of appropriate genetic strategies to address its study and, at least in Escherichia coli, the need of Cas overproduction for insertion detection. In this work, we describe the development and application in Escherichia coli strains of an interference-independent assay based on engineered selectable CRISPR-spacer integration reporter plasmids. By using this tool without the constraint of interference or cas overexpression, we confirmed fundamental aspects of this process such as the critical requirement of Cas1 and Cas2 and the identity of the CTT protospacer motif for the E. coli K12 system. In addition, we defined the CWT motif for a non-K12 CRISPR-Cas variant, and obtained data supporting the implication of the leader in spacer orientation, the preferred acquisition from plasmids harboring cas genes and the occurrence of a sequential cleavage at the insertion site by a ruler mechanism.
CRISPR-spacer acquisition; Cascade; Escherichia coli K12; O157:H7; RNA-guided immunity; cas genes; protospacer adjacent motif; reporter plasmids; ruler mechanism; spacer orientation
To fend off foreign genetic elements, prokaryotes have developed several defense systems. The most recently discovered defense system, CRISPR/Cas, is sequence-specific, adaptive and heritable. The two central components of this system are the Cas proteins and the CRISPR RNA. The latter consists of repeat sequences that are interspersed with spacer sequences. The CRISPR locus is transcribed into a precursor RNA that is subsequently processed into short crRNAs. CRISPR/Cas systems have been identified in bacteria and archaea, and data show that many variations of this system exist. We analyzed the requirements for a successful defense reaction in the halophilic archaeon Haloferax volcanii. Haloferax encodes a CRISPR/Cas system of the I-B subtype, about which very little is known. Analysis of the mature crRNAs revealed that they contain a spacer as their central element, which is preceded by an eight-nucleotide-long 5′ handle that originates from the upstream repeat. The repeat sequences have the potential to fold into a minimal stem loop. Sequencing of the crRNA population indicated that not all of the spacers that are encoded by the three CRISPR loci are present in the same abundance. By challenging Haloferax with an invader plasmid, we demonstrated that the interaction of the crRNA with the invader DNA requires a 10-nucleotide-long seed sequence. In addition, we found that not all of the crRNAs from the three CRISPR loci are effective at triggering the degradation of invader plasmids. The interference does not seem to be influenced by the copy number of the invader plasmid.
archaea; Haloferax volcanii; CRISPR/Cas; crRNA; PAM; seed sequence
Clustered regularly interspaced short palindromic repeats (CRISPRs), together with an operon of CRISPR-associated (Cas) proteins, form an RNA-based prokaryotic immune system against exogenous genetic elements. Cas5 family proteins are found in several Type I CRISPR-Cas systems. Here we report the molecular function of Subtype I-C/Dvulg Cas5d from B. halodurans. We show that Cas5d cleaves pre-crRNA into unit length by recognizing both the hairpin structure and the 3′ single stranded sequence in the CRISPR repeat region. Cas5d structure reveals a ferredoxin domain-based architecture and a catalytic triad formed by Y46, K116 and H117 residues. We further show that after pre-crRNA processing, Cas5d assembles with crRNA, Csd1, and Csd2 proteins to form a multi-subunit interference complex similar to E. coli Cascade (CRISPR-associated complex for antiviral defense) in architecture. Our results suggest that formation of a crRNA-presenting Cascade-like complex is likely a common theme among Type I CRISPR subtypes.
Studies of the Escherichia, Neisseria, Thermotoga, and Mycobacteria clustered regularly interspaced short palindromic repeat (CRISPR) subtypes have resulted in a model whereby CRISPRs function as a defense system against bacteriophage infection and conjugative plasmid transfer. In contrast, we previously showed that the Yersinia-subtype CRISPR region of Pseudomonas aeruginosa strain UCBPP-PA14 plays no detectable role in viral immunity but instead is required for bacteriophage DMS3-dependent inhibition of biofilm formation by P. aeruginosa. The goal of this study is to define the components of the Yersinia-subtype CRISPR region required to mediate this bacteriophage-host interaction. We show that the Yersinia-subtype-specific CRISPR-associated (Cas) proteins Csy4 and Csy2 are essential for small CRISPR RNA (crRNA) production in vivo, while the Csy1 and Csy3 proteins are not absolutely required for production of these small RNAs. Further, we present evidence that the core Cas protein Cas3 functions downstream of small crRNA production and that this protein requires functional HD (predicted phosphohydrolase) and DEXD/H (predicted helicase) domains to suppress biofilm formation in DMS3 lysogens. We also determined that only spacer 1, which is not identical to any region of the DMS3 genome, mediates the CRISPR-dependent loss of biofilm formation. Our evidence suggests that gene 42 of phage DMS3 (DMS3-42) is targeted by CRISPR2 spacer 1 and that this targeting tolerates multiple point mutations between the spacer and DMS3-42 target sequence. This work demonstrates how the interaction between P. aeruginosa strain UCBPP-PA14 and bacteriophage DMS3 can be used to further our understanding of the diverse roles of CRISPR system function in bacteria.
A widespread feature in the genomes of most bacteria and archaea is an array of clustered, regularly interspaced short palindromic repeats (CRISPRs) that, together with a group of CRISPR-associated (Cas) proteins, mediate immunity against invasive nucleic acids such as plasmids and viruses. Here, the CRISPR-Cas system was activated in cells expressing a plasmid-encoded protein that was targeted to the twin-arginine translocation (Tat) pathway. Expression of this Tat substrate resulted in upregulation of the Cas enzymes and subsequent silencing of the encoding plasmid in a manner that required the BaeSR two-component regulatotory system, which is known to respond to extracytoplasmic stress. Further, we confirm that the CasCDE enzymes form a stable ternary complex and appear to function as the catalytic core of the Cas system to process CRISPR RNA into its mature form. Taken together, our results indicate that the CRISPR-Cas system targets DNA directly as part of a defense mechanism in bacteria that is overlapping with but not limited to phage infection.
CRISPR (cluster of regularly interspaced palindromic repeats) is a prokaryotic adaptive defence system, providing immunity against mobile genetic elements such as viruses. Genomically encoded crRNA (CRISPR RNA) is used by Cas (CRISPR-associated) proteins to target and subsequently degrade nucleic acids of invading entities in a sequence-dependent manner. The process is known as ‘interference’. In the present review we cover recent progress on the structural biology of the CRISPR/Cas system, focusing on the Cas proteins and complexes that catalyse crRNA biogenesis and interference. Structural studies have helped in the elucidation of key mechanisms, including the recognition and cleavage of crRNA by the Cas6 and Cas5 proteins, where remarkable diversity at the level of both substrate recognition and catalysis has become apparent. The RNA-binding RAMP (repeat-associated mysterious protein) domain is present in the Cas5, Cas6, Cas7 and Cmr3 protein families and RAMP-like domains are found in Cas2 and Cas10. Structural analysis has also revealed an evolutionary link between the small subunits of the type I and type III-B interference complexes. Future studies of the interference complexes and their constituent components will transform our understanding of the system.
antiviral defence; cluster of regularly interspaced palindromic repeats (CRISPR); crystallography; evolution; protein structure; repeat-associated mysterious protein (RAMP); BhCas5c, Bacillus halodurans Cas5c; CRISPR, cluster of regularly interspaced palindromic repeats; Cas, CRISPR-associated; Cascade, CRISPR-associated complex for antiviral defence; crRNA, CRISPR RNA; dsDNA, double-stranded DNA; EcoCas3, Escherichia coli Cas3; EM, electron microscopy; HD, histidine–aspartate; MjaCas3″, Methanocaldococcus jannaschii Cas3″; PaCas6f, Pseudomonas aeruginosa Cas6f; PAM, protospacer adjacent motif; PfuCas, Pyrococcus furiosus Cas; pre-crRNA, precursor crRNA; RAMP, repeat-associated mysterious protein; RRM, RNA recognition motif; ssDNA, single-stranded DNA; SsoCas, Sulfolobus solfataricus Cas; ssRNA, single-stranded RNA; SthCas3, Streptococcus thermophilus Cas3; tracrRNA, trans-activating crRNA; TtCas, Thermus thermophilus Cas
The adaptive immune system comprising CRISPR (clustered regularly interspaced short palindromic repeats) arrays and cas (CRISPR-associated) genes has been discovered in a wide range of bacteria and archaea and has recently attracted comprehensive investigations. However, the subtype I-B CRISPR-Cas system in haloarchaea has been less characterized. Here, we investigated Cas6-mediated RNA processing in Haloferax mediterranei. The Cas6 cleavage site, as well as the CRISPR transcription start site, was experimentally determined, and processing of CRISPR transcripts was detected with a progressively increasing pattern from early log to stationary phase. With genetic approaches, we discovered that the lack of Cas1, Cas3, or Cas4 unexpectedly resulted in a decrease of CRISPR transcripts, while Cas5, Cas6, and Cas7 were found to be essential in stabilizing mature CRISPR RNA (crRNA). Intriguingly, we observed a CRISPR- and Cas3-independent inhibition of a defective provirus, in which the putative Cascade (CRISPR-associated complex for antiviral defense) proteins (Cas5, Cas6, Cas7, and Cas8b) were indispensably required. A sequence carried by a proviral transcript was found to be homologous to the CRISPR repeat RNA and vulnerable to Cas6-mediated cleavage, implying a distinct interference mechanism that may account for this unusual inhibition. These results provide fundamental information for the subtype I-B CRISPR-Cas system in halophilic archaea and suggest diversified mechanisms and multiple physiological functions for the CRISPR-Cas system.
The prokaryotic Clusters of Regularly Interspaced Palindromic Repeats (CRISPR) system utilizes genomically-encoded CRISPR RNA (crRNA), derived from invading viruses and incorporated into ribonucleoprotein complexes with CRISPR-associated (CAS) proteins, to target and degrade viral DNA or RNA on subsequent infection. RNA is targeted by the CMR complex. In Sulfolobus solfataricus, this complex is composed of seven CAS protein subunits (Cmr1-7) and carries a diverse “payload” of targeting crRNA. The crystal structure of Cmr7 and low resolution structure of the complex are presented. S. solfataricus CMR cleaves RNA targets in an endonucleolytic reaction at UA dinucleotides. This activity is dependent on the 8-nucleotide repeat-derived 5′ sequence in the crRNA, but not on the presence of a proto-spacer associated motif (PAM) in the target. Both target and guide RNAs can be cleaved, although a single molecule of guide RNA can support the degradation of multiple targets.
Discriminating self and non-self is a universal requirement of immune systems. Adaptive immune systems in prokaryotes are centered around repetitive loci called CRISPRs (clustered regularly interspaced short palindromic repeat), into which invader DNA fragments are incorporated. CRISPR transcripts are processed into small RNAs that guide CRISPR-associated (Cas) proteins to invading nucleic acids by complementary base pairing. However, to avoid autoimmunity it is essential that these RNA-guides exclusively target invading DNA and not complementary DNA sequences (i.e., self-sequences) located in the host's own CRISPR locus. Previous work on the Type III-A CRISPR system from Staphylococcus epidermidis has demonstrated that a portion of the CRISPR RNA-guide sequence is involved in self versus non-self discrimination. This self-avoidance mechanism relies on sensing base pairing between the RNA-guide and sequences flanking the target DNA. To determine if the RNA-guide participates in self versus non-self discrimination in the Type I-E system from Escherichia coli we altered base pairing potential between the RNA-guide and the flanks of DNA targets. Here we demonstrate that Type I-E systems discriminate self from non-self through a base pairing-independent mechanism that strictly relies on the recognition of four unchangeable PAM sequences. In addition, this work reveals that the first base pair between the guide RNA and the PAM nucleotide immediately flanking the target sequence can be disrupted without affecting the interference phenotype. Remarkably, this indicates that base pairing at this position is not involved in foreign DNA recognition. Results in this paper reveal that the Type I-E mechanism of avoiding self sequences and preventing autoimmunity is fundamentally different from that employed by Type III-A systems. We propose the exclusive targeting of PAM-flanked sequences to be termed a target versus non-target discrimination mechanism.
CRISPR loci and their associated genes form a diverse set of adaptive immune systems that are widespread among prokaryotes. In these systems, the CRISPR-associated genes (cas) encode for proteins that capture fragments of invading DNA and integrate these sequences between repeat sequences of the host's CRISPR locus. This information is used upon re-infection to degrade invader genomes. Storing invader sequences in host genomes necessitates a mechanism to differentiate between invader sequences on invader genomes and invader sequences on the host genome. CRISPR-Cas of Staphylococcus epidermidis (Type III-A system) is inhibited when invader sequences are flanked by repeat sequences, and this prevents targeting of the CRISPR locus on the host genome. Here we demonstrate that Escherichia coli CRISPR-Cas (Type I-E system) is not inhibited by repeat sequences. Instead, this system is specifically activated by the presence of bona fide Protospacer Adjacent Motifs (PAMs) in the target. PAMs are conserved sequences adjoining invader sequences on the invader genome, and these sequences are never adjacent to invader sequences within host CRISPR loci. PAM recognition is not affected by base pairing potential of the target with the crRNA. As such, the Type I-E system lacks the ability to specifically recognize self DNA.
Viruses that infect bacteria are the most abundant biological agents on the planet and bacteria have evolved diverse defense mechanisms to combat these genetic parasites. One of these bacterial defense systems relies on a repetitive locus, referred to as a CRISPR (clusters of regularly interspaced short palindromic repeats). Bacteria and archaea acquire resistance to invading viruses and plasmids by integrating short fragments of foreign nucleic acids at one end of the CRISPR locus. CRISPR loci are transcribed and the long primary CRISPR transcript is processed into a library of small RNAs that guide the immune system to invading nucleic acids, which are subsequently degraded by dedicated nucleases. However, the development of CRISPR-mediated immune systems has not eradicated phages, suggesting that viruses have evolved mechanisms to subvert CRISPR-mediated protection. Recently, Bondy-Denomy and colleagues discovered several phage-encoded anti-CRISPR proteins that offer new insight into the ongoing molecular arms race between viral parasites and the immune systems of their hosts.
phage; bacterial immunity; RNA-guided immunity; anti-CRISPR; viral suppressors of RNAi (VSR); viral suppressors of CRISPR (VSC)
The CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats) found in prokaryotic genomes confer small RNA-mediated protection against viruses and other invaders. CRISPR loci contain iterations of a short repeat sequence alternating with small segments of varying invader-derived sequences. Distinct families of CRISPR-associated Cas proteins function to cleave within the repeat sequence of CRISPR transcripts and produce the individual invader-targeting crRNAs. Here we report the crystal structure of Pyrococcus furiosus Cas6 bound with a repeat RNA at 3.2 Å resolution. In contrast to other Cas families of endonucleases, Cas6 clasps nucleotides 2–9 of the repeat RNA using its two ferredoxin-like domains, and the enzyme-anchored 5’ end tethers the distal cleavage site of the RNA between nucleotides 22 and 23 to the predicted enzyme active site on the opposite side of the ferrodoxin-like domains. Our findings suggest a wrap-around mechanism for CRISPR RNA recognition and cleavage by Cas6 and related processing endonucleases.
All immune systems must distinguish self from non-self to repel invaders without inducing autoimmunity. Clustered, regularly interspaced, short palindromic repeat (CRISPR) loci protect bacteria and archaea from invasion by phage and plasmid DNA through a genetic interference pathway1–9. CRISPR loci are present in ~ 40% and ~90% of sequenced bacterial and archaeal genomes respectively10 and evolve rapidly, acquiring new spacer sequences to adapt to highly dynamic viral populations1, 11–13. Immunity requires a sequence match between the invasive DNA and the spacers that lie between CRISPR repeats1–9. Each cluster is genetically linked to a subset of the cas (CRISPR-associated) genes14–16 that collectively encode >40 families of proteins involved in adaptation and interference. CRISPR loci encode small CRISPR RNAs (crRNAs) that contain a full spacer flanked by partial repeat sequences2, 17–19. CrRNA spacers are thought to identify targets by direct Watson-Crick pairing with invasive “protospacer” DNA2, 3, but how they avoid targeting the spacer DNA within the encoding CRISPR locus itself is unknown. Here we have defined the mechanism of CRISPR self/non-self discrimination. In Staphylococcus epidermidis, target/crRNA mismatches at specific positions outside of the spacer sequence license foreign DNA for interference, whereas extended pairing between crRNA and CRISPR DNA repeats prevents autoimmunity. Hence, this CRISPR system uses the base-pairing potential of crRNAs not only to specify a target but also to spare the bacterial chromosome from interference. Differential complementarity outside of the spacer sequence is a built-in feature of all CRISPR systems, suggesting that this mechanism is a broadly applicable solution to the self/non-self dilemma that confronts all immune pathways.
Bacteria and Archaea encode clustered, regularly interspaced, short palindromic repeat (CRISPR) systems to confer adaptive immunity to invasive viruses and plasmids. Recent studies of CRISPR systems revealed that diverse CRISPR-associated (Cas) interference modules often coexist in different organisms but functions of cas genes have not been dissected in any of these systems. The crenarchaeon Sulfolobus islandicus encodes three distinct CRISPR interference modules, including a type IA system and two type IIIB systems: Cmr-α and Cmr-β. To study the genetic determinants of protospacer-adjacent motif (PAM)-dependent DNA targeting activity and mature CRISPR RNA (crRNA) production in this organism, mutants deleting individual genes of the type IA system or removing each of other Cas modules were constructed. Characterization of these mutants revealed that Cas7, Cas5, Cas6, Cas3′ and Cas3” are essential for PAM-dependent DNA targeting activity, whereas Csa5, along with all other Cas modules, is dispensable for the targeting in the crenarchaeon. Cas6 is implicated as the only enzyme for pre-crRNA processing and the crRNA maturation is independent of the DNA targeting activity. Importantly, we show that Cas7 and Cas5 are essential for stabilizing the processing intermediates and mature crRNAs, respectively, and that depleting the helicase or nuclease domain of Cas3 leads to the accumulation of processing intermediates. This demonstrates that in addition to Cas6, other Cas proteins of an archaeal type IA system also contribute to crRNA processing.
CRISPR/Cas; Cascade; protospacer-adjacent motifs; crRNA biogenesis; DNA interference; cas mutants; Sulfolobus islandicus
In prokaryotes, clustered regularly interspaced short palindromic repeats (CRISPRs) and their associated (Cas) proteins constitute a defence system against bacteriophages and plasmids. CRISPR/Cas systems acquire short spacer sequences from foreign genetic elements and incorporate these into their CRISPR arrays, generating a memory of past invaders. Defence is provided by short non-coding RNAs that guide Cas proteins to cleave complementary nucleic acids. While most spacers are acquired from phages and plasmids, there are examples of spacers that match genes elsewhere in the host bacterial chromosome. In Pectobacterium atrosepticum the type I-F CRISPR/Cas system has acquired a self-complementary spacer that perfectly matches a protospacer target in a horizontally acquired island (HAI2) involved in plant pathogenicity. Given the paucity of experimental data about CRISPR/Cas–mediated chromosomal targeting, we examined this process by developing a tightly controlled system. Chromosomal targeting was highly toxic via targeting of DNA and resulted in growth inhibition and cellular filamentation. The toxic phenotype was avoided by mutations in the cas operon, the CRISPR repeats, the protospacer target, and protospacer-adjacent motif (PAM) beside the target. Indeed, the natural self-targeting spacer was non-toxic due to a single nucleotide mutation adjacent to the target in the PAM sequence. Furthermore, we show that chromosomal targeting can result in large-scale genomic alterations, including the remodelling or deletion of entire pre-existing pathogenicity islands. These features can be engineered for the targeted deletion of large regions of bacterial chromosomes. In conclusion, in DNA–targeting CRISPR/Cas systems, chromosomal interference is deleterious by causing DNA damage and providing a strong selective pressure for genome alterations, which may have consequences for bacterial evolution and pathogenicity.
Bacteria have evolved mechanisms that provide protection from continual invasion by viruses and other foreign elements. Resistance systems, known as CRISPR/Cas, were recently discovered and equip bacteria and archaea with an “adaptive immune system.” This adaptive immunity provides a highly evolvable sequence-specific small RNA–based memory of past invasions by viruses and foreign genetic elements. There are many cases where these systems appear to target regions within the bacterial host's own genome (a possible autoimmunity), but the evolutionary rationale for this is unclear. Here, we demonstrate that CRISPR/Cas targeting of the host chromosome is highly toxic but that cells survive through mutations that alleviate the immune mechanism. We have used this phenotype to gain insight into how these systems function and show that large changes in the bacterial genome can occur. For example, targeting of a chromosomal pathogenicity island, important for virulence of the potato pathogen Pectobacterium atrosepticum, resulted in deletion of the island, which constituted ∼2% of the bacterial genome. These results have broad significance for the role of CRISPR/Cas systems and their impact on the evolution of bacterial genomes and virulence. In addition, this study demonstrates their potential as a tool for the targeted deletion of specific regions of bacterial chromosomes.
CRISPR-Cas is a rapidly evolving RNA-mediated adaptive immune system that protects bacteria and archaea against mobile genetic elements. The system relies on the activity of short mature CRISPR RNAs (crRNAs) that guide Cas protein(s) to silence invading nucleic acids. A set of CRISPR-Cas, type II, requires a trans-activating small RNA, tracrRNA, for maturation of precursor crRNA (pre-crRNA) and interference with invading sequences. Following co-processing of tracrRNA and pre-crRNA by RNase III, dual-tracrRNA:crRNA guides the CRISPR-associated endonuclease Cas9 (Csn1) to cleave site-specifically cognate target DNA. Here, we screened available genomes for type II CRISPR-Cas loci by searching for Cas9 orthologs. We analyzed 75 representative loci, and for 56 of them we predicted novel tracrRNA orthologs. Our analysis demonstrates a high diversity in cas operon architecture and position of the tracrRNA gene within CRISPR-Cas loci. We observed a correlation between locus heterogeneity and Cas9 sequence diversity, resulting in the identification of various type II CRISPR-Cas subgroups. We validated the expression and co-processing of predicted tracrRNAs and pre-crRNAs by RNA sequencing in five bacterial species. This study reveals tracrRNA family as an atypical, small RNA family with no obvious conservation of structure, sequence or localization within type II CRISPR-Cas loci. The tracrRNA family is however characterized by the conserved feature to base-pair to cognate pre-crRNA repeats, an essential function for crRNA maturation and DNA silencing by dual-RNA:Cas9. The large panel of tracrRNA and Cas9 ortholog sequences should constitute a useful database to improve the design of RNA-programmable Cas9 as genome editing tool.
tracrRNA; CRISPR-Cas; type II system; Cas9 (Csn1); RNA processing; RNA maturation; small non-coding RNA; bacteria; adaptive immunity; mobile genetic elements
Bacteria and archaea face continual onslaughts of rapidly diversifying viruses and plasmids. Many prokaryotes maintain adaptive immune systems known as clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (Cas). CRISPR-Cas systems are genomic sensors that serially acquire viral and plasmid DNA fragments (spacers) that are utilized to target and cleave matching viral and plasmid DNA in subsequent genomic invasions, offering critical immunological memory. Only 50% of sequenced bacteria possess CRISPR-Cas immunity, in contrast to over 90% of sequenced archaea. To probe why half of bacteria lack CRISPR-Cas immunity, we combined comparative genomics and mathematical modeling. Analysis of hundreds of diverse prokaryotic genomes shows that CRISPR-Cas systems are substantially more prevalent in thermophiles than in mesophiles. With sequenced bacteria disproportionately mesophilic and sequenced archaea mostly thermophilic, the presence of CRISPR-Cas appears to depend more on environmental temperature than on bacterial-archaeal taxonomy. Mutation rates are typically severalfold higher in mesophilic prokaryotes than in thermophilic prokaryotes. To quantitatively test whether accelerated viral mutation leads microbes to lose CRISPR-Cas systems, we developed a stochastic model of virus-CRISPR coevolution. The model competes CRISPR-Cas-positive (CRISPR-Cas+) prokaryotes against CRISPR-Cas-negative (CRISPR-Cas−) prokaryotes, continually weighing the antiviral benefits conferred by CRISPR-Cas immunity against its fitness costs. Tracking this cost-benefit analysis across parameter space reveals viral mutation rate thresholds beyond which CRISPR-Cas cannot provide sufficient immunity and is purged from host populations. These results offer a simple, testable viral diversity hypothesis to explain why mesophilic bacteria disproportionately lack CRISPR-Cas immunity. More generally, fundamental limits on the adaptability of biological sensors (Lamarckian evolution) are predicted.
A remarkable recent discovery in microbiology is that bacteria and archaea possess systems conferring immunological memory and adaptive immunity. Clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (CRISPR-Cas) are genomic sensors that allow prokaryotes to acquire DNA fragments from invading viruses and plasmids. Providing immunological memory, these stored fragments destroy matching DNA in future viral and plasmid invasions. CRISPR-Cas systems also provide adaptive immunity, keeping up with mutating viruses and plasmids by continually acquiring new DNA fragments. Surprisingly, less than 50% of mesophilic bacteria, in contrast to almost 90% of thermophilic bacteria and Archaea, maintain CRISPR-Cas immunity. Using mathematical modeling, we probe this dichotomy, showing how increased viral mutation rates can explain the reduced prevalence of CRISPR-Cas systems in mesophiles. Rapidly mutating viruses outrun CRISPR-Cas immune systems, likely decreasing their prevalence in bacterial populations. Thus, viral adaptability may select against, rather than for, immune adaptability in prokaryotes.