Two recent publications have demonstrated how delivering CRISPR nucleases provides a promising solution to the growing problem of bacterial antibiotic resistance.
Shiga toxin-producing Escherichia coli (STEC) strains (n = 194) representing 43 serotypes and E. coli K-12 were examined for clustered regularly interspaced short palindromic repeat (CRISPR) arrays to study genetic relatedness among STEC serotypes. A subset of the strains (n = 81) was further analyzed for subtype I-E cas and virulence genes to determine a possible association of CRISPR elements with potential virulence. Four types of CRISPR arrays were identified. CRISPR1 and CRISPR2 were present in all strains tested; 1 strain also had both CRISPR3 and CRISPR4, whereas 193 strains displayed a short, combined array, CRISPR3-4. A total of 3,353 spacers were identified, representing 528 distinct spacers. The average length of a spacer was 32 bp. Approximately one-half of the spacers (54%) were unique and found mostly in strains of less common serotypes. Overall, CRISPR spacer contents correlated well with STEC serotypes, and identical arrays were shared between strains with the same H type (O26:H11, O103:H11, and O111:H11). There was no association identified between the presence of subtype I-E cas and virulence genes, but the total number of spacers had a negative correlation with potential pathogenicity (P < 0.05). Fewer spacers were found in strains that had a greater probability of causing outbreaks and disease than in those with lower virulence potential (P < 0.05). The relationship between the CRISPR-cas system and potential virulence needs to be determined on a broader scale, and the biological link will need to be established.
Clustered regularly interspaced short palindromic repeats (CRISPR) in combination with associated sequences (cas) constitute the CRISPR-Cas immune system, which uptakes DNA from invasive genetic elements as novel “spacers” that provide a genetic record of immunization events. We investigated the potential of CRISPR-based genotyping of Lactobacillus buchneri, a species relevant for commercial silage, bioethanol, and vegetable fermentations. Upon investigating the occurrence and diversity of CRISPR-Cas systems in Lactobacillus buchneri genomes, we observed a ubiquitous occurrence of CRISPR arrays containing a 36-nucleotide (nt) type II-A CRISPR locus adjacent to four cas genes, including the universal cas1 and cas2 genes and the type II signature gene cas9. Comparative analysis of CRISPR spacer content in 26 L. buchneri pickle fermentation isolates associated with spoilage revealed 10 unique locus genotypes that contained between 9 and 29 variable spacers. We observed a set of conserved spacers at the ancestral end, reflecting a common origin, as well as leader-end polymorphisms, reflecting recent divergence. Some of these spacers showed perfect identity with phage sequences, and many spacers showed homology to Lactobacillus plasmid sequences. Following a comparative analysis of sequences immediately flanking protospacers that matched CRISPR spacers, we identified a novel putative protospacer-adjacent motif (PAM), 5′-AAAA-3′. Overall, these findings suggest that type II-A CRISPR-Cas systems are valuable for genotyping of L. buchneri.
Two recent papers in Science illustrate how the prokaryotic CRISPR-Cas immune system machinery, which typically targets invasive genetic elements such as viruses and plasmids, can be converted into a sophisticated molecular tool for next-generation human genome editing. The versatile Cas9 RNA-guided endonuclease can be readily reprogrammed using customizable small RNAs for sequence-specific single- or double-stranded DNA cleavage.
Many strains of Bifidobacterium animalis subsp. lactis are considered health-promoting probiotic microorganisms and are commonly formulated into fermented dairy foods. Analyses of previously sequenced genomes of B. animalis subsp. lactis have revealed little genetic diversity, suggesting that it is a monomorphic subspecies. However, during a multilocus sequence typing survey of Bifidobacterium, it was revealed that B. animalis subsp. lactis ATCC 27673 gave a profile distinct from that of the other strains of the subspecies. As part of an ongoing study designed to understand the genetic diversity of this subspecies, the genome of this strain was sequenced and compared to other sequenced genomes of B. animalis subsp. lactis and B. animalis subsp. animalis. The complete genome of ATCC 27673 was 1,963,012 bp, contained 1,616 genes and 4 rRNA operons, and had a G+C content of 61.55%. Comparative analyses revealed that the genome of ATCC 27673 contained six distinct genomic islands encoding 83 open reading frames not found in other strains of the same subspecies. In four islands, either phage or mobile genetic elements were identified. In island 6, a novel clustered regularly interspaced short palindromic repeat (CRISPR) locus which contained 81 unique spacers was identified. This type I-E CRISPR-cas system differs from the type I-C systems previously identified in this subspecies, representing the first identification of a different system in B. animalis subsp. lactis. This study revealed that ATCC 27673 is a strain of B. animalis subsp. lactis with novel genetic content and suggests that the lack of genetic variability observed is likely due to the repeated sequencing of a limited number of widely distributed commercial strains.
Evolutionary studies of clustered regularly interspaced short palindromic repeats (CRISPRs) and their associated (cas) genes can provide insights into host-pathogen co-evolutionary dynamics and the frequency at which different genomic events (e.g., horizontal vs. vertical transmission) occur. Within this study, we used whole genome sequence (WGS) data to determine the evolutionary history and genetic diversity of CRISPR loci and cas genes among a diverse set of 427 Salmonella enterica ssp. enterica isolates representing 64 different serovars. We also evaluated the performance of CRISPR loci for typing when compared to whole genome and multilocus sequence typing (MLST) approaches. We found that there was high diversity in array length within both CRISPR1 (median = 22; min = 3; max = 79) and CRISPR2 (median = 27; min = 2; max = 221). There was also much diversity within serovars (e.g., arrays differed by as many as 50 repeat-spacer units among Salmonella ser. Senftenberg isolates). Interestingly, we found that there are two general cas gene profiles that do not track phylogenetic relationships, which suggests that non-vertical transmission events have occurred frequently throughout the evolutionary history of the sampled isolates. There is also considerable variation among the ranges of pairwise distances estimated within each cas gene, which may be indicative of the strength of natural selection acting on those genes. We developed a novel clustering approach based on CRISPR spacer content, but found that typing based on CRISPRs was less accurate than the MLST-based alternative; typing based on WGS data was the most accurate. Notwithstanding cost and accessibility, we anticipate that draft genome sequencing, due to its greater discriminatory power, will eventually become routine for traceback investigations.
Salmonella; Horizontal gene transfer; Evolution; CRISPR; Outbreak; Phylogeny; Whole genome sequencing; Typing
The Shiga toxin-producing Escherichia coli (STEC) strains, including those of O157:H7 and the “big six” serogroups (i.e., serogroups O26, O45, O103, O111, O121, and O145), are a group of pathogens designated food adulterants in the United States. The relatively conserved nature of clustered regularly interspaced short palindromic repeats (CRISPRs) in phylogenetically related E. coli strains makes them potential subtyping markers for STEC detection, and a quantitative PCR (qPCR)-based assay was previously developed for O26:H11, O45:H2, O103:H2, O111:H8, O121:H19, O145:H28, and O157:H7 isolates. To better evaluate the sensitivity and specificity of this qPCR method, the CRISPR loci of 252 O157 and big-six STEC isolates were sequenced and analyzed along with 563 CRISPR1 and 624 CRISPR2 sequences available in GenBank. General conservation of spacer content and order was observed within each O157 and big-six serogroup, validating the qPCR method. Meanwhile, it was found that spacer deletion, the presence of an insertion sequence, and distinct alleles within a serogroup are sources of false-negative reactions. Conservation of CRISPR arrays among isolates expressing the same flagellar antigen, specifically, H7, H2, and H11, suggested that these isolates share an ancestor and provided an explanation for the false positives previously observed in the qPCR results. An analysis of spacer distribution across E. coli strains provided limited evidence for temporal spacer acquisition. Conversely, comparison of CRISPR sequences between strains along the stepwise evolution of O157:H7 from its O55:H7 ancestor revealed that, over this ∼7,000-year span, spacer deletion was the primary force generating CRISPR diversity.
Salmonella enterica subsp. enterica serovar Typhimurium is a leading cause of food-borne salmonellosis in the United States. The number of antibiotic-resistant isolates identified in humans is steadily increasing, suggesting that the spread of antibiotic-resistant strains is a major threat to public health. S. Typhimurium is commonly identified in a wide range of animal hosts, food sources, and environments, but little is known about the factors mediating the spread of antibiotic resistance in this ecologically complex serovar. Previously, we developed a subtyping method, CRISPR–multi-virulence-locus sequence typing (MVLST), which discriminates among strains of several common S. enterica serovars. Here, CRISPR-MVLST identified 22 sequence types within a collection of 76 S. Typhimurium isolates from a variety of animal sources throughout central Pennsylvania. Six of the sequence types were identified in more than one isolate, and we observed statistically significant differences in resistance among these sequence types to 7 antibiotics commonly used in veterinary and human medicine, such as ceftiofur and ampicillin (P < 0.05). Importantly, five of these sequence types were subsequently identified in human clinical isolates, and a subset of these isolates had identical antibiotic resistance patterns, suggesting that these subpopulations are being transmitted through the food system. Therefore, CRISPR-MVLST is a promising subtyping method for monitoring the farm-to-fork spread of antibiotic resistance in S. Typhimurium.
CRISPR (clustered regularly interspaced short palindromic repeats)-Cas (CRISPR-associated) systems in bacteria and archaea employ CRISPR RNAs to specifically recognize the complementary DNA of foreign invaders, leading to sequence-specific cleavage or degradation of the target DNA. Recent work has shown that the accidental or intentional targeting of the bacterial genome is cytotoxic and can lead to cell death. Here, we have demonstrated that genome targeting with CRISPR-Cas systems can be employed for the sequence-specific and titratable removal of individual bacterial strains and species. Using the type I-E CRISPR-Cas system in Escherichia coli as a model, we found that this effect could be elicited using native or imported systems and was similarly potent regardless of the genomic location, strand, or transcriptional activity of the target sequence. Furthermore, the specificity of targeting with CRISPR RNAs could readily distinguish between even highly similar strains in pure or mixed cultures. Finally, varying the collection of delivered CRISPR RNAs could quantitatively control the relative number of individual strains within a mixed culture. Critically, the observed selectivity and programmability of bacterial removal would be virtually impossible with traditional antibiotics, bacteriophages, selectable markers, or tailored growth conditions. Once delivery challenges are addressed, we envision that this approach could offer a novel means to quantitatively control the composition of environmental and industrial microbial consortia and may open new avenues for the development of “smart” antibiotics that circumvent multidrug resistance and differentiate between pathogenic and beneficial microorganisms.
Controlling the composition of microbial populations is a critical aspect in medicine, biotechnology, and environmental cycles. While different antimicrobial strategies, such as antibiotics, antimicrobial peptides, and lytic bacteriophages, offer partial solutions, what remains elusive is a generalized and programmable strategy that can distinguish between even closely related microorganisms and that allows for fine control over the composition of a microbial population. This study demonstrates that RNA-directed immune systems in bacteria and archaea called CRISPR-Cas systems can provide such a strategy. These systems can be employed to selectively and quantitatively remove individual bacterial strains based purely on sequence information, creating opportunities in the treatment of multidrug-resistant infections, the control of industrial fermentations, and the study of microbial consortia.
Salmonella enterica subsp. enterica serovar Newport (S. Newport) is the third most prevalent cause of food-borne salmonellosis. Rapid, efficient, and accurate methods for identification are required to track specific strains of S. Newport during outbreaks. By exploiting the hypervariable nature of virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs), we previously developed a sequence-based subtyping approach, designated CRISPR–multi-virulence-locus sequence typing (CRISPR-MVLST). To demonstrate the applicability of this approach, we analyzed a broad set of S. Newport isolates collected over a 5-year period by using CRISPR-MVLST and pulsed-field gel electrophoresis (PFGE). Among 84 isolates, we defined 38 S. Newport sequence types (NSTs), all of which were novel compared to our previous analyses, and 62 different PFGE patterns. Our data suggest that both subtyping approaches have high discriminatory abilities (>0.95) with a potential for clustering cases with common exposures. Importantly, we found that isolates from closely related NSTs were often similar by PFGE profile as well, further corroborating the applicability of CRISPR-MVLST. In the first full application of CRISPR-MVLST, we analyzed isolates from a recent S. Newport outbreak. In this blinded study, we confirmed the utility of CRISPR-MVLST and were able to distinguish the 10 outbreak isolates, as defined by PFGE and epidemiological data, from a collection of 20 S. Newport isolates. Together, our data show that CRISPR-MVLST could be a complementary approach to PFGE subtyping for S. Newport.
Salmonella enterica subsp. enterica serovars Typhimurium (S. Typhimurium) and Heidelberg (S. Heidelberg) are major causes of foodborne salmonellosis, accounting for a fifth of all annual salmonellosis cases in the United States. Rapid, efficient and accurate methods for identification are required for routine surveillance and to track specific strains during outbreaks. We used Pulsed-field Gel Electrophoresis (PFGE) and a recently developed molecular subtyping approach termed CRISPR-MVLST that exploits the hypervariable nature of virulence genes and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) to subtype clinical S. Typhimurium and S. Heidelberg isolates.
We analyzed a broad set of 175 S. Heidelberg and S. Typhimurium isolates collected over a five-year period. We identified 21 Heidelberg Sequence Types (HSTs) and 37 Typhimurium STs (TSTs) that were represented by 27 and 45 PFGE pulsotypes, respectively, and determined the discriminatory power of each method.
For S. Heidelberg, our data shows that combined typing by both CRISPR-MVLST and PFGE provided a discriminatory power of 0.9213. Importantly, CRISPR-MVLST was able to separate common PFGE patterns such as JF6X01.0022 into distinct STs, thus providing significantly greater discriminatory power. Conversely, we show that subtyping by either CRISPR-MVLST or PFGE independently provides a sufficient discriminatory power (0.9345 and 0.9456, respectively) for S. Typhimurium. Additionally, using isolates from two S. Typhimurium outbreaks, we demonstrate that CRISPR-MVLST provides excellent epidemiologic concordance.
CRISPR subtyping; CRISPR-MVLST; Molecular subtyping; Salmonella Heidelberg; Salmonella Typhimurium
The enteric pathogen Salmonella enterica is one of the leading causes of foodborne illness in the world. The species is extremely diverse, containing more than 2,500 named serovars that are designated for their unique antigen characters and pathogenicity profiles—some are known to be virulent pathogens, while others are not. Questions regarding the evolution of pathogenicity, significance of antigen characters, diversity of clustered regularly interspaced short palindromic repeat (CRISPR) loci, among others, will remain elusive until a strong evolutionary framework is established. We present the first large-scale S. enterica subsp. enterica phylogeny inferred from a new reference-free k-mer approach of gathering single nucleotide polymorphisms (SNPs) from whole genomes. The phylogeny of 156 isolates representing 78 serovars (102 were newly sequenced) reveals two major lineages, each with many strongly supported sublineages. One of these lineages is the S. Typhi group; well nested within the phylogeny. Lineage-through-time analyses suggest there have been two instances of accelerated rates of diversification within the subspecies. We also found that antigen characters and CRISPR loci reveal different evolutionary patterns than that of the phylogeny, suggesting that a horizontal gene transfer or possibly a shared environmental acquisition might have influenced the present character distribution. Our study also shows the ability to extract reference-free SNPs from a large set of genomes and then to use these SNPs for phylogenetic reconstruction. This automated, annotation-free approach is an important step forward for bacterial disease tracking and in efficiently elucidating the evolutionary history of highly clonal organisms.
H antigens; serovar; O antigens; CRISPR; lineage-through-time plot; comparative method
We present the 1,991,830-bp complete genome sequence of Lactobacillus acidophilus strain La-14 (SD-5212). Comparative genomic analysis revealed 99.98% similarity overall to the L. acidophilus NCFM genome. Globally, 111 single nucleotide polymorphisms (SNPs) (95 SNPs, 16 indels) were observed throughout the genome. Also, a 416-bp deletion in the LA14_1146 sugar ABC transporter was identified.
Probiotic bifidobacteria in combination with prebiotic carbohydrates have documented positive effects on human health regarding gastrointestinal disorders and improved immunity, however the selective routes of uptake remain unknown for most candidate prebiotics. The differential transcriptomes of Bifidobacterium animalis subsp. lactis Bl-04, induced by 11 potential prebiotic oligosaccharides were analyzed to identify the genetic loci involved in the uptake and catabolism of α- and β-linked hexoses, and β-xylosides.
The overall transcriptome was modulated dependent on the type of glycoside (galactosides, glucosides or xylosides) utilized. Carbohydrate transporters of the major facilitator superfamily (induced by gentiobiose and β-galacto-oligosaccharides (GOS)) and ATP-binding cassette (ABC) transporters (upregulated by cellobiose, GOS, isomaltose, maltotriose, melibiose, panose, raffinose, stachyose, xylobiose and β-xylo-oligosaccharides) were differentially upregulated, together with glycoside hydrolases from families 1, 2, 13, 36, 42, 43 and 77. Sequence analysis of the identified solute-binding proteins that determine the specificity of ABC transporters revealed similarities in the breadth and selectivity of prebiotic utilization by bifidobacteria.
This study identified the differential gene expression for utilization of potential prebiotics highlighting the extensive capabilities of Bifidobacterium lactis Bl-04 to utilize oligosaccharides. Results provide insights into the ability of this probiotic microbe to utilize indigestible carbohydrates in the human gastrointestinal tract.
Bifidobacterium lactis; Transcriptomics; ABC transporter; GPH transporter; Prebiotics; Glycoside hydrolase
The Cas9-crRNA complex of the Streptococcus thermophilus DGCC7710 CRISPR3-Cas system functions as an RNA-guided endonuclease with crRNA-directed target sequence recognition and protein-mediated DNA cleavage. We show here that an additional RNA molecule, tracrRNA (trans-activating CRISPR RNA), co-purifies with the Cas9 protein isolated from the heterologous E. coli strain carrying the S. thermophilus DGCC7710 CRISPR3-Cas system. We provide experimental evidence that tracrRNA is required for Cas9-mediated DNA interference both in vitro and in vivo. We show that Cas9 specifically promotes duplex formation between the precursor crRNA (pre-crRNA) transcript and tracrRNA, in vitro. Furthermore, the housekeeping RNase III contributes to primary pre-crRNA-tracrRNA duplex cleavage for mature crRNA biogenesis. RNase III, however, is not required in the processing of a short pre-crRNA transcribed from a minimal CRISPR array containing a single spacer. Finally, we show that an in vitro-assembled ternary Cas9-crRNA-tracrRNA complex cleaves DNA. This study further specifies the molecular basis for crRNA-based re-programming of Cas9 to specifically cleave any target DNA sequence for precise genome surgery. The processes for crRNA maturation and effector complex assembly established here will contribute to the further development of the Cas9 re-programmable system for genome editing applications.
CRISPR; DNA silencing; Type II CRISPR-Cas systems
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), together with associated genes (cas), form the CRISPR–cas adaptive immune system, which can provide resistance to viruses and plasmids in bacteria and archaea. Here, we use mathematical models, population dynamic experiments, and DNA sequence analyses to investigate the host–phage interactions in a model CRISPR–cas system, Streptococcus thermophilus DGCC7710 and its virulent phage 2972. At the molecular level, the bacteriophage-immune mutant bacteria (BIMs) and CRISPR–escape mutant phage (CEMs) obtained in this study are consistent with those anticipated from an iterative model of this adaptive immune system: resistance by the addition of novel spacers and phage evasion of resistance by mutation in matching sequences or flanking motifs. While CRISPR BIMs were readily isolated and CEMs generated at high rates (frequencies in excess of 10−6), our population studies indicate that there is more to the dynamics of phage–host interactions and the establishment of a BIM–CEM arms race than predicted from existing assumptions about phage infection and CRISPR–cas immunity. Among the unanticipated observations are: (i) the invasion of phage into populations of BIMs resistant by the acquisition of one (but not two) spacers, (ii) the survival of sensitive bacteria despite the presence of high densities of phage, and (iii) the maintenance of phage-limited communities due to the failure of even two-spacer BIMs to become established in populations with wild-type bacteria and phage. We attribute (i) to incomplete resistance of single-spacer BIMs. Based on the results of additional modeling and experiments, we postulate that (ii) and (iii) can be attributed to the phage infection-associated production of enzymes or other compounds that induce phenotypic phage resistance in sensitive bacteria and kill resistant BIMs. We present evidence in support of these hypotheses and discuss the implications of these results for the ecology and (co)evolution of bacteria and phage.
The evidence that the CRISPR regions of the genomes of archaea and bacteria play a role in the ecology and (co)evolution of these microbes and their viruses is overwhelming: (i) the spacers (variable sequences of 26–72 bp of DNA between the repeats of this region) of these prokaryotes are homologous to the DNA of viruses in their communities; (ii) experimentally, the acquisition and incorporation of spacers of viral DNA can protect these organisms from subsequent infection by these viruses; (iii) experimentally, viruses evade this immunity by mutation in homologous protospacers or protospacer-adjacent motifs (PAMs). Not so clear are the nature and magnitude of the role CRISPR plays in this ecology and evolution. Here, we use mathematical models, experiments with Streptococcus thermophilus and the phage 2972, and DNA sequence analyses to explore the contribution of CRISPR–cas immunity to the ecology and (co)evolution of bacteria and their viruses. The results of this study suggest that the contribution of CRISPR to the ecology of bacteria and phage is more modest and limited, and the conditions for a CRISPR–mediated coevolutionary arms race between these organisms more restrictive, than anticipated from models based on the canonical view of phage infection and CRISPR–cas immunity.
We present the complete genomes of Bifidobacterium animalis subsp. lactis B420 and Bi-07. Comparative genomic analysis with the type strain DSMZ10140 revealed 40 to 55 single nucleotide polymorphisms (SNPs) and an indel in a clustered regularly interspaced short palindromic repeat (CRISPR) locus. These genetic differences provide a molecular basis for strain typing within the two main phylogenetic groups of this monomorphic species.
The broad ecological distribution of L. casei makes it an insightful subject for research on genome evolution and lifestyle adaptation. To explore evolutionary mechanisms that determine genomic diversity of L. casei, we performed comparative analysis of 17 L. casei genomes representing strains collected from dairy, plant, and human sources.
Differences in L. casei genome inventory revealed an open pan-genome comprised of 1,715 core and 4,220 accessory genes. Extrapolation of pan-genome data indicates L. casei has a supragenome approximately 3.2 times larger than the average genome of individual strains. Evidence suggests horizontal gene transfer from other bacterial species, particularly lactobacilli, has been important in adaptation of L. casei to new habitats and lifestyles, but evolution of dairy niche specialists also appears to involve gene decay.
Genome diversity in L. casei has evolved through gene acquisition and decay. Acquisition of foreign genomic islands likely confers a fitness benefit in specific habitats, notably plant-associated niches. Loss of unnecessary ancestral traits in strains collected from bacterial-ripened cheeses supports the hypothesis that gene decay contributes to enhanced fitness in that niche. This study gives the first evidence for a L. casei supragenome and provides valuable insights into mechanisms for genome evolution and lifestyle adaptation of this ecologically flexible and industrially important lactic acid bacterium. Additionally, our data confirm the Distributed Genome Hypothesis extends to non-pathogenic, ecologically flexible species like L. casei.
Lactobacillus casei; Lactic acid bacteria; Comparative genomics; Pan-genome; Supragenome; Evolution; Adaptation
The human gastrointestinal tract can be positively modulated by dietary supplementation of probiotic bacteria in combination with prebiotic carbohydrates. Here differential transcriptomics and functional genomics were used to identify genes in Lactobacillus acidophilus NCFM involved in the uptake and catabolism of 11 potential prebiotic compounds consisting of α- and β- linked galactosides and glucosides. These oligosaccharides induced genes encoding phosphoenolpyruvate-dependent sugar phosphotransferase systems (PTS), galactoside pentose hexuronide (GPH) permease, and ATP-binding cassette (ABC) transporters. PTS systems were upregulated primarily by di- and tri-saccharides such as cellobiose, isomaltose, isomaltulose, panose and gentiobiose, while ABC transporters were upregulated by raffinose, Polydextrose, and stachyose. A single GPH transporter was induced by lactitol and galactooligosaccharides (GOS). The various transporters were associated with a number of glycoside hydrolases from families 1, 2, 4, 13, 32, 36, 42, and 65, involved in the catabolism of various α- and β-linked glucosides and galactosides. Further subfamily specialization was also observed for different PTS-associated GH1 6-phospho-β-glucosidases implicated in the catabolism of gentiobiose and cellobiose. These findings highlight the broad oligosaccharide metabolic repertoire of L. acidophilus NCFM and establish a platform for selection and screening of both probiotic bacteria and prebiotic compounds that may positively influence the gastrointestinal microbiota.
The CRISPR–Cas (clustered regularly interspaced short palindromic repeats–CRISPR-associated proteins) modules are adaptive immunity systems that are present in many archaea and bacteria. These defence systems are encoded by operons that have an extraordinarily diverse architecture and a high rate of evolution for both the cas genes and the unique spacer content. Here, we provide an updated analysis of the evolutionary relationships between CRISPR–Cas systems and Cas proteins. Three major types of CRISPR–Cas system are delineated, with a further division into several subtypes and a few chimeric variants. Given the complexity of the genomic architectures and the extremely dynamic evolution of the CRISPR–Cas systems, a unified classification of these systems should be based on multiple criteria. Accordingly, we propose a `polythetic' classification that integrates the phylogenies of the most common cas genes, the sequence and organization of the CRISPR repeats and the architecture of the CRISPR–cas loci.
The CRISPR/Cas system, comprised of clustered regularly interspaced short palindromic repeats along with their associated (Cas) proteins, protects bacteria and archaea from viral predation and invading nucleic acids. While the mechanism of action for this acquired immunity is currently under investigation, the response of Cas protein expression to phage infection has yet to be elucidated. In this study, we employed shotgun proteomics to measure the global proteome expression in a model system for studying the CRISPR/Cas response in S. thermophilus DGCC7710 infected with phage 2972. Host and viral proteins were simultaneously measured following inoculation at two different multiplicities of infection and across various time points using two-dimensional liquid chromatography tandem mass spectrometry. Thirty-seven out of forty predicted viral proteins were detected, including all proteins of the structural virome and viral effector proteins. In total, 1,013 of 2,079 predicted S. thermophilus proteins were detected, facilitating the monitoring of host protein synthesis changes in response to virus infection. Importantly, Cas proteins from all four CRISPR loci in the S. thermophilus DGCC7710 genome were detected, including loci previously thought to be inactive. Many Cas proteins were found to be constitutively expressed, but several demonstrated increased abundance following infection, including the signature Cas9 proteins from the CRISPR1 and CRISPR3 loci, which are key players in the interference phase of the CRISPR/Cas response. Altogether, these results provide novel insights into the proteomic response of S. thermophilus, specifically CRISPR-associated proteins, upon phage 2972 infection.
Well-studied innate immune systems exist throughout bacteria and archaea, but a more recently discovered genomic locus may offer prokaryotes surprising immunological adaptability. Mediated by a cassette-like genomic locus termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR), the microbial adaptive immune system differs from its eukaryotic immune analogues by incorporating new immunities unidirectionally. CRISPR thus stores genomically recoverable timelines of virus-host coevolution in natural organisms refractory to laboratory cultivation. Here we combined a population genetic mathematical model of CRISPR-virus coevolution with six years of metagenomic sequencing to link the recoverable genomic dynamics of CRISPR loci to the unknown population dynamics of virus and host in natural communities. Metagenomic reconstructions in an acid-mine drainage system document CRISPR loci conserving ancestral immune elements to the base-pair across thousands of microbial generations. This ‘trailer-end conservation’ occurs despite rapid viral mutation and despite rapid prokaryotic genomic deletion. The trailer-ends of many reconstructed CRISPR loci are also largely identical across a population. ‘Trailer-end clonality’ occurs despite predictions of host immunological diversity due to negative frequency dependent selection (kill the winner dynamics). Statistical clustering and model simulations explain this lack of diversity by capturing rapid selective sweeps by highly immune CRISPR lineages. Potentially explaining ‘trailer-end conservation,’ we record the first example of a viral bloom overwhelming a CRISPR system. The polyclonal viruses bloom even though they share sequences previously targeted by host CRISPR loci. Simulations show how increasing random genomic deletions in CRISPR loci purges immunological controls on long-lived viral sequences, allowing polyclonal viruses to bloom and depressing host fitness. Our results thus link documented patterns of genomic conservation in CRISPR loci to an evolutionary advantage against persistent viruses. By maintaining old immunities, selection may be tuning CRISPR-mediated immunity against viruses reemerging from lysogeny or migration.
Most microbes appear unculturable in the laboratory, limiting our knowledge of how virus and prokaryotic host evolve in natural systems. However, a genomic locus found in many prokaryotes, CRISPR, may offer cultivation-independent probes of virus-microbe coevolution. Utilizing nearby genes, CRISPR can serially incorporate short viral and plasmid sequences. These sequences bind and cleave cognate regions in subsequent viral and plasmid insertions, conferring adaptive anti-viral and anti-plasmid immunity. By incorporating sequences undirectionally, CRISPR also provides timelines of virus-prokaryote coevolution. Yet, CRISPR only incorporates 30–80 base-pair viral sequences, leaving incomplete coevolutionary recordings. To reconstruct the missing coevolutionary dynamics shaping natural CRISPRs, we combined metagenomic reconstructions with population-scale mathematical modeling. Capturing rare and rapid sweeps of CRISPR diversity by highly immune lines, mathematical modeling explains why naturally reconstructed CRISPR loci are often largely identical across a population. Both model and experiment further document surprising proliferations of old viral sequences against which hosts had preexisting CRISPR immunity. Due to these deadly blooms of ancestral viral elements, CRISPR's conservation of old immune sequences appears to confer a selective advantage. This may explain the striking immunological memory documented in CRISPR loci, which occurs despite rapid viral mutation and despite rapid deletions in prokaryotic genomes.
Salmonella enterica subsp. enterica serovar Enteritidis is a major cause of food-borne salmonellosis in the United States. Two major food vehicles for S. Enteritidis are contaminated eggs and chicken meat. Improved subtyping methods are needed to accurately track specific strains of S. Enteritidis related to human salmonellosis throughout the chicken and egg food system. A sequence typing scheme based on virulence genes (fimH and sseL) and clustered regularly interspaced short palindromic repeats (CRISPRs)—CRISPR-including multi-virulence-locus sequence typing (designated CRISPR-MVLST)—was used to characterize 35 human clinical isolates, 46 chicken isolates, 24 egg isolates, and 63 hen house environment isolates of S. Enteritidis. A total of 27 sequence types (STs) were identified among the 167 isolates. CRISPR-MVLST identified three persistent and predominate STs circulating among U.S. human clinical isolates and chicken, egg, and hen house environmental isolates in Pennsylvania, and an ST that was found only in eggs and humans. It also identified a potential environment-specific sequence type. Moreover, cluster analysis based on fimH and sseL identified a number of clusters, of which several were found in more than one outbreak, as well as 11 singletons. Further research is needed to determine if CRISPR-MVLST might help identify the ecological origins of S. Enteritidis strains that contaminate chickens and eggs.
Salmonella enterica subsp. enterica is the leading cause of bacterial food-borne disease in the United States. Molecular subtyping methods are powerful tools for tracking the farm-to-fork spread of food-borne pathogens during outbreaks. In order to develop a novel multilocus sequence typing (MLST) scheme for subtyping the major serovars of S. enterica subsp. enterica, the virulence genes sseL and fimH and clustered regularly interspaced short palindromic repeat (CRISPR) loci were sequenced from 171 clinical isolates from nine Salmonella serovars, Salmonella serovars Typhimurium, Enteritidis, Newport, Heidelberg, Javiana, I 4,,12:i:−, Montevideo, Muenchen, and Saintpaul. The MLST scheme using only virulence genes was congruent with serotyping and identified epidemic clones but could not differentiate outbreaks. The addition of CRISPR sequences dramatically improved discriminatory power by differentiating individual outbreak strains/clones. Of particular note, the present MLST scheme provided better discrimination of Salmonella serovar Enteritidis strains than pulsed-field gel electrophoresis (PFGE). This method showed high epidemiologic concordance for all serovars screened except for Salmonella serovar Muenchen. In conclusion, the novel MLST scheme described in the present study accurately differentiated outbreak strains/clones of the major serovars of Salmonella, and therefore, it shows promise for subtyping this important food-borne pathogen during investigations of outbreaks.
The CRISPR/Cas adaptive immune system provides resistance against phages and plasmids in Archaea and Bacteria. CRISPR loci integrate short DNA sequences from invading genetic elements that provide small RNA-mediated interference in subsequent exposure to matching nucleic acids. In Streptococcus thermophilus, it was previously shown that the CRISPR1/Cas system can provide adaptive immunity against phages and plasmids by integrating novel spacers following exposure to these foreign genetic elements that subsequently direct the specific cleavage of invasive homologous DNA sequences. Here, we show that the S. thermophilus CRISPR3/Cas system can be transferred into Escherichia coli and provide heterologous protection against plasmid transformation and phage infection. We show that interference is sequence-specific, and that mutations in the vicinity or within the proto-spacer adjacent motif (PAM) allow plasmids to escape CRISPR-encoded immunity. We also establish that cas9 is the sole cas gene necessary for CRISPR-encoded interference. Furthermore, mutation analysis revealed that interference relies on the Cas9 McrA/HNH- and RuvC/RNaseH-motifs. Altogether, our results show that active CRISPR/Cas systems can be transferred across distant genera and provide heterologous interference against invasive nucleic acids. This can be leveraged to develop strains more robust against phage attack, and safer organisms less likely to uptake and disseminate plasmid-encoded undesirable genetic elements.