PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-25 (1356060)

Clipboard (0)
None

Related Articles

1.  Diversity of putative archaeal RNA viruses in metagenomic datasets of a yellowstone acidic hot spring 
SpringerPlus  2015;4:189.
Two genomic fragments (5,662 and 1,269 nt in size, GenBank accession no. JQ756122 and JQ756123, respectively) of novel, positive-strand RNA viruses that infect archaea were first discovered in an acidic hot spring in Yellowstone National Park (Bolduc et al., 2012). To investigate the diversity of these newly identified putative archaeal RNA viruses, global metagenomic datasets were searched for sequences that were significantly similar to those of the viruses. A total of 3,757 associated reads were retrieved solely from the Yellowstone datasets and were used to assemble the genomes of the putative archaeal RNA viruses. Nine contigs with lengths ranging from 417 to 5,866 nt were obtained, 4 of which were longer than 2,200 nt; one contig was 204 nt longer than JQ756122, representing the longest genomic sequence of the putative archaeal RNA viruses. These contigs revealed more than 50% sequence similarity to JQ756122 or JQ756123 and may be partial or nearly complete genomes of novel genogroups or genotypes of the putative archaeal RNA viruses. Sequence and phylogenetic analyses indicated that the archaeal RNA viruses are genetically diverse, with at least 3 related viral lineages in the Yellowstone acidic hot spring environment.
Electronic supplementary material
The online version of this article (doi:10.1186/s40064-015-0973-z) contains supplementary material, which is available to authorized users.
doi:10.1186/s40064-015-0973-z
PMCID: PMC4405519  PMID: 25918685
Putative archaeal RNA viruses; Sequence assembly; Viral diversity; Yellowstone acidic hot spring
2.  Genome signature analysis of thermal virus metagenomes reveals Archaea and thermophilic signatures 
BMC Genomics  2008;9:420.
Background
Metagenomic analysis provides a rich source of biological information for otherwise intractable viral communities. However, study of viral metagenomes has been hampered by its nearly complete reliance on BLAST algorithms for identification of DNA sequences. We sought to develop algorithms for examination of viral metagenomes to identify the origin of sequences independent of BLAST algorithms. We chose viral metagenomes obtained from two hot springs, Bear Paw and Octopus, in Yellowstone National Park, as they represent simple microbial populations where comparatively large contigs were obtained. Thermal spring metagenomes have high proportions of sequences without significant Genbank homology, which has hampered identification of viruses and their linkage with hosts. To analyze each metagenome, we developed a method to classify DNA fragments using genome signature-based phylogenetic classification (GSPC), where metagenomic fragments are compared to a database of oligonucleotide signatures for all previously sequenced Bacteria, Archaea, and viruses.
Results
From both Bear Paw and Octopus hot springs, each assembled contig had more similarity to other metagenome contigs than to any sequenced microbial genome based on GSPC analysis, suggesting a genome signature common to each of these extreme environments. While viral metagenomes from Bear Paw and Octopus share some similarity, the genome signatures from each locale are largely unique. GSPC using a microbial database predicts most of the Octopus metagenome has archaeal signatures, while bacterial signatures predominate in Bear Paw; a finding consistent with those of Genbank BLAST. When using a viral database, the majority of the Octopus metagenome is predicted to belong to archaeal virus Families Globuloviridae and Fuselloviridae, while none of the Bear Paw metagenome is predicted to belong to archaeal viruses. As expected, when microbial and viral databases are combined, each of the Octopus and Bear Paw metagenomic contigs are predicted to belong to viruses rather than to any Bacteria or Archaea, consistent with the apparent viral origin of both metagenomes.
Conclusion
That BLAST searches identify no significant homologs for most metagenome contigs, while GSPC suggests their origin as archaeal viruses or bacteriophages, indicates GSPC provides a complementary approach in viral metagenomic analysis.
doi:10.1186/1471-2164-9-420
PMCID: PMC2556352  PMID: 18798991
3.  The Characterization of RNA Viruses in Tropical Seawater Using Targeted PCR and Metagenomics 
mBio  2014;5(3):e01210-14.
ABSTRACT
Viruses have a profound influence on the ecology and evolution of plankton, but our understanding of the composition of the aquatic viral communities is still rudimentary. This is especially true of those viruses having RNA genomes. The limited data that have been published suggest that the RNA virioplankton is dominated by viruses with positive-sense, single-stranded (+ss) genomes that have features in common with those of eukaryote-infecting viruses in the order Picornavirales (picornavirads). In this study, we investigated the diversity of the RNA virus assemblages in tropical coastal seawater samples using targeted PCR and metagenomics. Amplification of RNA-dependent RNA polymerase (RdRp) genes from fractions of a buoyant density gradient suggested that the distribution of two major subclades of the marine picornavirads was largely congruent with the distribution of total virus-like RNA, a finding consistent with their proposed dominance. Analyses of the RdRp sequences in the library revealed the presence of many diverse phylotypes, most of which were related only distantly to those of cultivated viruses. Phylogenetic analysis suggests that there were hundreds of unique picornavirad-like phylotypes in one 35-liter sample that differed from one another by at least as much as the differences among currently recognized species. Assembly of the sequences in the metagenome resulted in the reconstruction of six essentially complete viral genomes that had features similar to viruses in the families Bacillarna-, Dicistro-, and Marnaviridae. Comparison of the tropical seawater metagenomes with those from other habitats suggests that +ssRNA viruses are generally the most common types of RNA viruses in aquatic environments, but biases in library preparation remain a possible explanation for this observation.
IMPORTANCE
Marine plankton account for much of the photosynthesis and respiration on our planet, and they influence the cycling of carbon and the distribution of nutrients on a global scale. Despite the fundamental importance of viruses to plankton ecology and evolution, most of the viruses in the sea, and the identities of their hosts, are unknown. This report is one of very few that delves into the genetic diversity within RNA-containing viruses in the ocean. The data expand the known range of viral diversity and shed new light on the physical properties and genetic composition of RNA viruses in the ocean.
doi:10.1128/mBio.01210-14
PMCID: PMC4068258  PMID: 24939887
4.  Analyses of the radiation of birnaviruses from diverse host phyla and of their evolutionary affinities with other double-stranded RNA and positive strand RNA viruses using robust structure-based multiple sequence alignments and advanced phylogenetic methods 
Background
Birnaviruses form a distinct family of double-stranded RNA viruses infecting animals as different as vertebrates, mollusks, insects and rotifers. With such a wide host range, they constitute a good model for studying the adaptation to the host. Additionally, several lines of evidence link birnaviruses to positive strand RNA viruses and suggest that phylogenetic analyses may provide clues about transition.
Results
We characterized the genome of a birnavirus from the rotifer Branchionus plicalitis. We used X-ray structures of RNA-dependent RNA polymerases and capsid proteins to obtain multiple structure alignments that allowed us to obtain reliable multiple sequence alignments and we employed “advanced” phylogenetic methods to study the evolutionary relationships between some positive strand and double-stranded RNA viruses. We showed that the rotifer birnavirus genome exhibited an organization remarkably similar to other birnaviruses. As this host was phylogenetically very distant from the other known species targeted by birnaviruses, we revisited the evolutionary pathways within the Birnaviridae family using phylogenetic reconstruction methods. We also applied a number of phylogenetic approaches based on structurally conserved domains/regions of the capsid and RNA-dependent RNA polymerase proteins to study the evolutionary relationships between birnaviruses, other double-stranded RNA viruses and positive strand RNA viruses.
Conclusions
We show that there is a good correlation between the phylogeny of the birnaviruses and that of their hosts at the phylum level using the RNA-dependent RNA polymerase (genomic segment B) on the one hand and a concatenation of the capsid protein, protease and ribonucleoprotein (genomic segment A) on the other hand. This correlation tends to vanish within phyla. The use of advanced phylogenetic methods and robust structure-based multiple sequence alignments allowed us to obtain a more accurate picture (in terms of probability of the tree topologies) of the evolutionary affinities between double-stranded RNA and positive strand RNA viruses. In particular, we were able to show that there exists a good statistical support for the claims that dsRNA viruses are not monophyletic and that viruses with permuted RdRps belong to a common evolution lineage as previously proposed by other groups. We also propose a tree topology with a good statistical support describing the evolutionary relationships between the Picornaviridae, Caliciviridae, Flaviviridae families and a group including the Alphatetraviridae, Nodaviridae, Permutotretraviridae, Birnaviridae, and Cystoviridae families.
doi:10.1186/1471-2148-13-154
PMCID: PMC3724706  PMID: 23865988
RNA-dependent RNA polymerase; Capsid protein; Double-stranded RNA viruses; Positive strand RNA viruses; Rotifer; Maximum likelihood phylogeny method; Bayesian phylogeny method; Structure-based alignments
5.  Characterization of the Archaeal Thermophile Sulfolobus Turreted Icosahedral Virus Validates an Evolutionary Link among Double-Stranded DNA Viruses from All Domains of Life 
Journal of Virology  2006;80(15):7625-7635.
Icosahedral nontailed double-stranded DNA (dsDNA) viruses are present in all three domains of life, leading to speculation about a common viral ancestor that predates the divergence of Eukarya, Bacteria, and Archaea. This suggestion is supported by the shared general architecture of this group of viruses and the common fold of their major capsid protein. However, limited information on the diversity and replication of archaeal viruses, in general, has hampered further analysis. Sulfolobus turreted icosahedral virus (STIV), isolated from a hot spring in Yellowstone National Park, was the first icosahedral virus with an archaeal host to be described. Here we present a detailed characterization of the components forming this unusual virus. Using a proteomics-based approach, we identified nine viral and two host proteins from purified STIV particles. Interestingly, one of the viral proteins originates from a reading frame lacking a consensus start site. The major capsid protein (B345) was found to be glycosylated, implying a strong similarity to proteins from other dsDNA viruses. Sequence analysis and structural predication of virion-associated viral proteins suggest that they may have roles in DNA packaging, penton formation, and protein-protein interaction. The presence of an internal lipid layer containing acidic tetraether lipids has also been confirmed. The previously presented structural models in conjunction with the protein, lipid, and carbohydrate information reported here reveal that STIV is strikingly similar to viruses associated with the Bacteria and Eukarya domains of life, further strengthening the hypothesis for a common ancestor of this group of dsDNA viruses from all domains of life.
doi:10.1128/JVI.00522-06
PMCID: PMC1563717  PMID: 16840341
6.  Zn2+ Inhibits Coronavirus and Arterivirus RNA Polymerase Activity In Vitro and Zinc Ionophores Block the Replication of These Viruses in Cell Culture 
PLoS Pathogens  2010;6(11):e1001176.
Increasing the intracellular Zn2+ concentration with zinc-ionophores like pyrithione (PT) can efficiently impair the replication of a variety of RNA viruses, including poliovirus and influenza virus. For some viruses this effect has been attributed to interference with viral polyprotein processing. In this study we demonstrate that the combination of Zn2+ and PT at low concentrations (2 µM Zn2+ and 2 µM PT) inhibits the replication of SARS-coronavirus (SARS-CoV) and equine arteritis virus (EAV) in cell culture. The RNA synthesis of these two distantly related nidoviruses is catalyzed by an RNA-dependent RNA polymerase (RdRp), which is the core enzyme of their multiprotein replication and transcription complex (RTC). Using an activity assay for RTCs isolated from cells infected with SARS-CoV or EAV—thus eliminating the need for PT to transport Zn2+ across the plasma membrane—we show that Zn2+ efficiently inhibits the RNA-synthesizing activity of the RTCs of both viruses. Enzymatic studies using recombinant RdRps (SARS-CoV nsp12 and EAV nsp9) purified from E. coli subsequently revealed that Zn2+ directly inhibited the in vitro activity of both nidovirus polymerases. More specifically, Zn2+ was found to block the initiation step of EAV RNA synthesis, whereas in the case of the SARS-CoV RdRp elongation was inhibited and template binding reduced. By chelating Zn2+ with MgEDTA, the inhibitory effect of the divalent cation could be reversed, which provides a novel experimental tool for in vitro studies of the molecular details of nidovirus replication and transcription.
Author Summary
Positive-stranded RNA (+RNA) viruses include many important pathogens. They have evolved a variety of replication strategies, but are unified in the fact that an RNA-dependent RNA polymerase (RdRp) functions as the core enzyme of their RNA-synthesizing machinery. The RdRp is commonly embedded in a membrane-associated replication complex that is assembled from viral RNA, and viral and host proteins. Given their crucial function in the viral replicative cycle, RdRps are key targets for antiviral research. Increased intracellular Zn2+ concentrations are known to efficiently impair replication of a number of RNA viruses, e.g. by interfering with correct proteolytic processing of viral polyproteins. Here, we not only show that corona- and arterivirus replication can be inhibited by increased Zn2+ levels, but also use both isolated replication complexes and purified recombinant RdRps to demonstrate that this effect may be based on direct inhibition of nidovirus RdRps. The combination of protocols described here will be valuable for future studies into the function of nidoviral enzyme complexes.
doi:10.1371/journal.ppat.1001176
PMCID: PMC2973827  PMID: 21079686
7.  A Unique Nodavirus with Novel Features: Mosinovirus Expresses Two Subgenomic RNAs, a Capsid Gene of Unknown Origin, and a Suppressor of the Antiviral RNA Interference Pathway 
Journal of Virology  2014;88(22):13447-13459.
ABSTRACT
Insects are a reservoir for many known and novel viruses. We discovered an unknown virus, tentatively named mosinovirus (MoNV), in mosquitoes from a tropical rainforest region in Côte d'Ivoire. The MoNV genome consists of two segments of positive-sense RNA of 2,972 nucleotides (nt) (RNA 1) and 1,801 nt (RNA 2). Its putative RNA-dependent RNA polymerase shares 43% amino acid identity with its closest relative, that of the Pariacoto virus (family Nodaviridae). Unexpectedly, for the putative capsid protein, maximal pairwise identity of 16% to Lake Sinai virus 2, an unclassified virus with a nonsegmented RNA genome, was found. Moreover, MoNV virions are nonenveloped and about 50 nm in diameter, larger than any of the known nodaviruses. Mature MoNV virions contain capsid proteins of ∼56 kDa, which do not seem to be cleaved from a longer precursor. Northern blot analyses revealed that MoNV expresses two subgenomic RNAs of 580 nt (RNA 3) and 292 nt (RNA 4). RNA 4 encodes a viral suppressor of RNA interference (RNAi) that shares its mechanism with the B2 RNAi suppressor protein of other nodaviruses despite lacking recognizable similarity to these proteins. MoNV B2 binds long double-stranded RNA (dsRNA) and, accordingly, inhibits Dicer-2-mediated processing of dsRNA into small interfering RNAs (siRNAs). Phylogenetic analyses indicate that MoNV is a novel member of the family Nodaviridae that acquired its capsid gene via reassortment from an unknown, distantly related virus beyond the family level.
IMPORTANCE The identification of novel viruses provides important information about virus evolution and diversity. Here, we describe an unknown unique nodavirus in mosquitoes, named mosinovirus (MoNV). MoNV was classified as a nodavirus based on its genome organization and on phylogenetic analyses of the RNA-dependent RNA polymerase. Notably, its capsid gene was acquired from an unknown virus with a distant relationship to nodaviruses. Another remarkable feature of MoNV is that, unlike other nodaviruses, it expresses two subgenomic RNAs (sgRNAs). One of the sgRNAs expresses a protein that counteracts antiviral defense of its mosquito host, whereas the function of the other sgRNA remains unknown. Our results show that complete genome segments can be exchanged beyond the species level and suggest that insects harbor a large repertoire of exceptional viruses.
doi:10.1128/JVI.02144-14
PMCID: PMC4249075  PMID: 25210176
8.  Raw Sewage Harbors Diverse Viral Populations 
mBio  2011;2(5):e00180-11.
ABSTRACT
At this time, about 3,000 different viruses are recognized, but metagenomic studies suggest that these viruses are a small fraction of the viruses that exist in nature. We have explored viral diversity by deep sequencing nucleic acids obtained from virion populations enriched from raw sewage. We identified 234 known viruses, including 17 that infect humans. Plant, insect, and algal viruses as well as bacteriophages were also present. These viruses represented 26 taxonomic families and included viruses with single-stranded DNA (ssDNA), double-stranded DNA (dsDNA), positive-sense ssRNA [ssRNA(+)], and dsRNA genomes. Novel viruses that could be placed in specific taxa represented 51 different families, making untreated wastewater the most diverse viral metagenome (genetic material recovered directly from environmental samples) examined thus far. However, the vast majority of sequence reads bore little or no sequence relation to known viruses and thus could not be placed into specific taxa. These results show that the vast majority of the viruses on Earth have not yet been characterized. Untreated wastewater provides a rich matrix for identifying novel viruses and for studying virus diversity.
Importance At this time, virology is focused on the study of a relatively small number of viral species. Specific viruses are studied either because they are easily propagated in the laboratory or because they are associated with disease. The lack of knowledge of the size and characteristics of the viral universe and the diversity of viral genomes is a roadblock to understanding important issues, such as the origin of emerging pathogens and the extent of gene exchange among viruses. Untreated wastewater is an ideal system for assessing viral diversity because virion populations from large numbers of individuals are deposited and because raw sewage itself provides a rich environment for the growth of diverse host species and thus their viruses. These studies suggest that the viral universe is far more vast and diverse than previously suspected.
Importance
At this time, virology is focused on the study of a relatively small number of viral species. Specific viruses are studied either because they are easily propagated in the laboratory or because they are associated with disease. The lack of knowledge of the size and characteristics of the viral universe and the diversity of viral genomes is a roadblock to understanding important issues, such as the origin of emerging pathogens and the extent of gene exchange among viruses. Untreated wastewater is an ideal system for assessing viral diversity because virion populations from large numbers of individuals are deposited and because raw sewage itself provides a rich environment for the growth of diverse host species and thus their viruses. These studies suggest that the viral universe is far more vast and diverse than previously suspected.
doi:10.1128/mBio.00180-11
PMCID: PMC3187576  PMID: 21972239
9.  Chromatin is an ancient innovation conserved between Archaea and Eukarya 
eLife  2012;1:e00078.
The eukaryotic nucleosome is the fundamental unit of chromatin, comprising a protein octamer that wraps ∼147 bp of DNA and has essential roles in DNA compaction, replication and gene expression. Nucleosomes and chromatin have historically been considered to be unique to eukaryotes, yet studies of select archaea have identified homologs of histone proteins that assemble into tetrameric nucleosomes. Here we report the first archaeal genome-wide nucleosome occupancy map, as observed in the halophile Haloferax volcanii. Nucleosome occupancy was compared with gene expression by compiling a comprehensive transcriptome of Hfx. volcanii. We found that archaeal transcripts possess hallmarks of eukaryotic chromatin structure: nucleosome-depleted regions at transcriptional start sites and conserved −1 and +1 promoter nucleosomes. Our observations demonstrate that histones and chromatin architecture evolved before the divergence of Archaea and Eukarya, suggesting that the fundamental role of chromatin in the regulation of gene expression is ancient.
DOI: http://dx.doi.org/10.7554/eLife.00078.001
eLife digest
Single-celled microorganisms called archaea are one of the three domains of cellular life, along with bacteria and eukaryotes. Archaea are similar to bacteria in that they do not have nuclei, but genetically they have more in common with eukaryotes. Archaea are found in a wide range of habitats including the human colon, marshlands, the ocean and extreme environments such as hot springs and salt lakes.
It has been known since the 1990s that the DNA of archaea is wrapped around histones to form complexes that closely resemble the nucleosomes found in eukaryotes, albeit with four rather than eight histone subunits. Nucleosomes are the fundamental units of chromatin, the highly-ordered and compact structure that all the DNA in a cell is packed into. Now we know exactly how many nucleosomes are present in a given cell for some eukaryotes, notably yeast, and to a good approximation we know the position of each nucleosome during a variety of metabolic states and physiological conditions. We can also quantify the nucleosome occupancy, which is measure of the length of time that the nucleosomes spend in contact with the DNA: this is a critical piece of information because it determines the level of access that other proteins, including those that regulate gene expression, have to the DNA. These advances have been driven in large part by advances in technology, notably high-density microarrays for genome wide-studies of nucleosome occupancy, and massively parallel sequencing for direct nucleosome sequencing.
Ammar et al. have used these techniques to explore how the DNA of Haloferax volcanii, a species of archaea that thrives in the hyper-salty waters of the Dead Sea, is organized on a genome-wide basis. Despite some clear differences between the genomes of archaea and eukaryotes—for example, genomic DNA is typically circular in archaea and linear in eukaryotes—they found that the genome of Hfx. volcanii is organized into chromatin in a way that is remarkably similar to that seen in all eukaryotic genomes studied to date. This is surprising given that the chromatin in eukaryotes is confined to the nucleus, whereas there are no such constraints in archaea. In particular, Ammar et al. found that those regions of the DNA near the ends of genes that mark where the transcription of the DNA into RNA should begin and end contain have lower nucleosome occupancy than other regions. Moreover, the overall level of occupancy in Hfx. volcanii was twice that of eukaryotes, which is what one would expect given that nucleosomes in archaea contain half as many histone subunits as nucleosomes in eukaryotes. Ammar et al. also confirmed that that the degree of nucleosome occupancy is correlated with gene expression.
These two findings—the similarities between the chromatin in archaea and eukaryotes, and the correlation between nucleosome occupancy and gene expression in archaea—raise an interesting evolutionary possibility: the initial function of nucleosomes and chromatin formation might have been for the regulation of gene expression rather than the packaging of DNA. This is consistent with two decades of research that has shown that there is an extraordinary and complex relationship between the structure of chromatin and the process of gene expression. It is possible, therefore, that as the early eukaryotes evolved, nucleosomes and chromatin started to package DNA into compact structures that, among other things, helped to prevent DNA damage, and that this subsequently enabled the early eukaryotes to flourish.
DOI: http://dx.doi.org/10.7554/eLife.00078.002
doi:10.7554/eLife.00078
PMCID: PMC3510453  PMID: 23240084
Haloferax volcanii; Nucleosome; Chromatin; Transcriptome; RNA-seq; Archaea; Other
10.  Three Novel Virophage Genomes Discovered from Yellowstone Lake Metagenomes 
Journal of Virology  2014;89(2):1278-1285.
ABSTRACT
Virophages are a unique group of circular double-stranded DNA viruses that are considered parasites of giant DNA viruses, which in turn are known to infect eukaryotic hosts. In this study, the genomes of three novel Yellowstone Lake virophages (YSLVs)—YSLV5, YSLV6, and YSLV7—were identified from Yellowstone Lake through metagenomic analyses. The relative abundance of these three novel virophages and previously identified Yellowstone Lake virophages YSLV1 to -4 were determined in different locations of the lake, revealing that most of the sampled locations in the lake, including both mesophilic and thermophilic habitats, had multiple virophage genotypes. This likely reflects the diverse habitats or diversity of the eukaryotic hosts and their associated giant viruses that serve as putative hosts for these virophages. YSLV5 has a 29,767-bp genome with 32 predicted open reading frames (ORFs), YSLV6 has a 24,837-bp genome with 29 predicted ORFs, and YSLV7 has a 23,193-bp genome with 26 predicted ORFs. Based on multilocus phylogenetic analysis, YSLV6 shows a close evolutionary relationship with YSLV1 to -4, whereas YSLV5 and YSLV7 are distantly related to the others, and YSLV7 represents the fourth novel virophage lineage. In addition, the genome of YSLV5 has a G+C content of 51.1% that is much higher than all other known virophages, indicating a unique host range for YSLV5. These results suggest that virophages are abundant and have diverse genotypes that likely mirror diverse giant viral and eukaryotic hosts within the Yellowstone Lake ecosystem.
IMPORTANCE This study discovered novel virophages present within the Yellowstone Lake ecosystem using a conserved major capsid protein as a phylogenetic anchor for assembly of sequence reads from Yellowstone Lake metagenomic samples. The three novel virophage genomes (YSLV5 to -7) were completed by identifying specific environmental samples containing these respective virophages, and closing gaps by targeted PCR and sequencing. Most of the YSLV genotypes were associated primarily with photic-zone and nonhydrothermal samples; however, YSLV5 had a unique distribution with an occurrence in vent samples similar to that in photic-zone samples and with a higher GC content that suggests a distinct host and habitat compared to other YSLVs. In addition, genome content and phylogenetic analyses indicate that YSLV5 and YSLV7 are distinct from known virophages and that additional as-yet-uncharacterized virophages are likely present within the Yellowstone Lake ecosystem.
doi:10.1128/JVI.03039-14
PMCID: PMC4300641  PMID: 25392206
11.  Assembly of Viral Metagenomes from Yellowstone Hot Springs▿ † 
Applied and Environmental Microbiology  2008;74(13):4164-4174.
Thermophilic viruses were reported decades ago; however, knowledge of their diversity, biology, and ecological impact is limited. Previous research on thermophilic viruses focused on cultivated strains. This study examined metagenomic profiles of viruses directly isolated from two mildly alkaline hot springs, Bear Paw (74°C) and Octopus (93°C). Using a new method for constructing libraries from picograms of DNA, nearly 30 Mb of viral DNA sequence was determined. In contrast to previous studies, sequences were assembled at 50% and 95% identity, creating composite contigs up to 35 kb and facilitating analysis of the inherent heterogeneity in the populations. Lowering the assembly identity reduced the estimated number of viral types from 1,440 and 1,310 to 548 and 283, respectively. Surprisingly, the diversity of viral species in these springs approaches that in moderate-temperature environments. While most known thermophilic viruses have a chronic, nonlytic infection lifestyle, analysis of coding sequences suggests lytic viruses are more common in geothermal environments than previously thought. The 50% assembly included one contig with high similarity and perfect synteny to nine genes from Pyrobaculum spherical virus (PSV). In fact, nearly all the genes of the 28-kb genome of PSV have apparent homologs in the metagenomes. Similarities to thermoacidophilic viruses isolated on other continents were limited to specific open reading frames but were equally strong. Nearly 25% of the reads showed significant similarity between the hot springs, suggesting a common subterranean source. To our knowledge, this is the first application of metagenomics to viruses of geothermal origin.
doi:10.1128/AEM.02598-07
PMCID: PMC2446518  PMID: 18441115
12.  The complete genomes of three viruses assembled from shotgun libraries of marine RNA virus communities 
Virology Journal  2007;4:69.
Background
RNA viruses have been isolated that infect marine organisms ranging from bacteria to whales, but little is known about the composition and population structure of the in situ marine RNA virus community. In a recent study, the majority of three genomes of previously unknown positive-sense single-stranded (ss) RNA viruses were assembled from reverse-transcribed whole-genome shotgun libraries. The present contribution comparatively analyzes these genomes with respect to representative viruses from established viral taxa.
Results
Two of the genomes (JP-A and JP-B), appear to be polycistronic viruses in the proposed order Picornavirales that fall into a well-supported clade of marine picorna-like viruses, the characterized members of which all infect marine protists. A temporal and geographic survey indicates that the JP genomes are persistent and widespread in British Columbia waters. The third genome, SOG, encodes a putative RNA-dependent RNA polymerase (RdRp) that is related to the RdRp of viruses in the family Tombusviridae, but the remaining SOG sequence has no significant similarity to any sequences in the NCBI database.
Conclusion
The complete genomes of these viruses permitted analyses that resulted in a more comprehensive comparison of these pathogens with established taxa. For example, in concordance with phylogenies based on the RdRp, our results support a close homology between JP-A and JP-B and RsRNAV. In contrast, although classification of the SOG genome based on the RdRp places SOG within the Tombusviridae, SOG lacks a capsid and movement protein conserved within this family and SOG is thus likely more distantly related to the Tombusivridae than the RdRp phylogeney indicates.
doi:10.1186/1743-422X-4-69
PMCID: PMC1948888  PMID: 17617913
13.  Use of Cellular CRISPR (Clusters of Regularly Interspaced Short Palindromic Repeats) Spacer-Based Microarrays for Detection of Viruses in Environmental Samples ▿ †  
Applied and Environmental Microbiology  2010;76(21):7251-7258.
It is currently difficult to detect unknown viruses in any given environment. The recent discovery of CRISPR (clusters of regularly interspaced short palindromic repeats) loci within bacterial and archaeal cellular genomes may provide an alternative approach to detect new viruses. It has been shown that the spacer sequences between the direct repeat units of the CRISPR loci are often derived from viruses and likely function as guide sequences to protect the cell from viral infection. The spacer sequences within the CRISPR loci may therefore serve as a record of the viruses that have replicated within the cell. We have cataloged the CRISPR spacer sequences from cellular metagenomic data from high-temperature (>80°C), acidic (pH < 4) hot spring environments located in Yellowstone National Park (YNP). We designed a microarray platform utilizing these CRISPR spacer sequences as potential probes to detect viruses present in YNP hot spring environments. We show that this microarray approach can detect viral sequences directly from virus-enriched environmental samples, detecting new viruses which have not been previously characterized. We further demonstrated that this microarray approach can be used to examine temporal changes in viral populations within the environment. Our results demonstrate that CRISPR spacer sequence-based microarrays will be useful tools for detecting and monitoring viruses from diverse environmental samples.
doi:10.1128/AEM.01109-10
PMCID: PMC2976250  PMID: 20851987
14.  Metagenomes from High-Temperature Chemotrophic Systems Reveal Geochemical Controls on Microbial Community Structure and Function 
PLoS ONE  2010;5(3):e9773.
The Yellowstone caldera contains the most numerous and diverse geothermal systems on Earth, yielding an extensive array of unique high-temperature environments that host a variety of deeply-rooted and understudied Archaea, Bacteria and Eukarya. The combination of extreme temperature and chemical conditions encountered in geothermal environments often results in considerably less microbial diversity than other terrestrial habitats and offers a tremendous opportunity for studying the structure and function of indigenous microbial communities and for establishing linkages between putative metabolisms and element cycling. Metagenome sequence (14–15,000 Sanger reads per site) was obtained for five high-temperature (>65°C) chemotrophic microbial communities sampled from geothermal springs (or pools) in Yellowstone National Park (YNP) that exhibit a wide range in geochemistry including pH, dissolved sulfide, dissolved oxygen and ferrous iron. Metagenome data revealed significant differences in the predominant phyla associated with each of these geochemical environments. Novel members of the Sulfolobales are dominant in low pH environments, while other Crenarchaeota including distantly-related Thermoproteales and Desulfurococcales populations dominate in suboxic sulfidic sediments. Several novel archaeal groups are well represented in an acidic (pH 3) Fe-oxyhydroxide mat, where a higher O2 influx is accompanied with an increase in archaeal diversity. The presence or absence of genes and pathways important in S oxidation-reduction, H2-oxidation, and aerobic respiration (terminal oxidation) provide insight regarding the metabolic strategies of indigenous organisms present in geothermal systems. Multiple-pathway and protein-specific functional analysis of metagenome sequence data corroborated results from phylogenetic analyses and clearly demonstrate major differences in metabolic potential across sites. The distribution of functional genes involved in electron transport is consistent with the hypothesis that geochemical parameters (e.g., pH, sulfide, Fe, O2) control microbial community structure and function in YNP geothermal springs.
doi:10.1371/journal.pone.0009773
PMCID: PMC2841643  PMID: 20333304
15.  Rescue of Infectious Birnavirus from Recombinant Ribonucleoprotein Complexes 
PLoS ONE  2014;9(1):e87790.
Birnaviruses are unconventional members of the icosahedral double-stranded (dsRNA) RNA virus group. The main differential birnavirus trait is the lack of the inner icosahedral transcriptional core, a ubiquitous structure conserved in all other icosahedral dsRNA viruses, that shelters the genome from cellular dsRNA sensors and provide the enzymatic machinery to produce and extrude mature messenger RNAs. In contrast, birnaviral particles enclose ribonucleoprotein (RNP) complexes formed by the genome segments, the dsRNA-binding VP3 polypeptide and the virus-encoded RNA polymerase (RdRp). The presence of RNPs suggests that the birnavirus replication program might exhibit significant differences with respect to those of prototypal dsRNA viruses. However, experimental evidences supporting this hypothesis are as yet scarce. Of particular relevance for the understanding of birnavirus replication is to determine whether RNPs act as intracellular capsid-independent transcriptional units. Our study was focused to answer this question using the infectious bursal disease virus (IBDV), the best characterized birnavirus, as model virus. Here, we describe the intracellular assembly of functional IBDV RNPs in the absence of the virus-encoded VP2 capsid polypeptide. Recombinant RNPs are generated upon coexpression of the IBDV VP1 and RdRp polypeptides and transfection of purified virus dsRNA. Presented data show that recombinant RNPs direct the expression of the IBDV polypeptide repertoire and the production of infectious virus in culture cells. Results described in this report constitute the first direct experimental evidence showing that birnaviral RNPs are intracellularly active in the absence of the virus capsid. This finding is consistent with presented data indicating that RNP formation precedes virus assembly in IBDV-infected cells, and supports the recently proposed IBDV replication model entailing the release of RNPs during the initial stages of the infection. Indeed, results presented here also support the previously proposed evolutionary connection between birnaviruses and positive-strand single-stranded RNA viruses.
doi:10.1371/journal.pone.0087790
PMCID: PMC3907549  PMID: 24498196
16.  Temporal order of evolution of DNA replication systems inferred by comparison of cellular and viral DNA polymerases 
Biology Direct  2006;1:39.
Background
The core enzymes of the DNA replication systems show striking diversity among cellular life forms and more so among viruses. In particular, and counter-intuitively, given the central role of DNA in all cells and the mechanistic uniformity of replication, the core enzymes of the replication systems of bacteria and archaea (as well as eukaryotes) are unrelated or extremely distantly related. Viruses and plasmids, in addition, possess at least two unique DNA replication systems, namely, the protein-primed and rolling circle modalities of replication. This unexpected diversity makes the origin and evolution of DNA replication systems a particularly challenging and intriguing problem in evolutionary biology.
Results
I propose a specific succession for the emergence of different DNA replication systems, drawing argument from the differences in their representation among viruses and other selfish replicating elements. In a striking pattern, the DNA replication systems of viruses infecting bacteria and eukaryotes are dominated by the archaeal-type B-family DNA polymerase (PolB) whereas the bacterial replicative DNA polymerase (PolC) is present only in a handful of bacteriophage genomes. There is no apparent mechanistic impediment to the involvement of the bacterial-type replication machinery in viral DNA replication. Therefore, I hypothesize that the observed, markedly unequal distribution of the replicative DNA polymerases among the known cellular and viral replication systems has a historical explanation. I propose that, among the two types of DNA replication machineries that are found in extant life forms, the archaeal-type, PolB-based system evolved first and had already given rise to a variety of diverse viruses and other selfish elements before the advent of the bacterial, PolC-based machinery. Conceivably, at that stage of evolution, the niches for DNA-viral reproduction have been already filled with viruses replicating with the help of the archaeal system, and viruses with the bacterial system never took off. I further suggest that the two other systems of DNA replication, the rolling circle mechanism and the protein-primed mechanism, which are represented in diverse selfish elements, also evolved prior to the emergence of the bacterial replication system. This hypothesis is compatible with the distinct structural affinities of PolB, which has the palm-domain fold shared with reverse transcriptases and RNA-dependent RNA polymerases, and PolC that has a distinct, unrelated nucleotidyltransferase fold. I propose that PolB is a descendant of polymerases that were involved in the replication of genetic elements in the RNA-protein world, prior to the emergence of DNA replication. By contrast, PolC might have evolved from an ancient non-templated polymerase, e.g., polyA polymerase. The proposed temporal succession of the evolving DNA replication systems does not depend on the specific scenario adopted for the evolution of cells and viruses, i.e., whether viruses are derived from cells or virus-like elements are thought to originate from a primordial gene pool. However, arguments are presented in favor of the latter scenario as the most parsimonious explanation of the evolution of DNA replication systems.
Conclusion
Comparative analysis of the diversity of genomic strategies and organizations of viruses and cellular life forms has the potential to open windows into the deep past of life's evolution, especially, with the regard to the origin of genome replication systems. When complemented with information on the evolution of the relevant protein folds, this comparative approach can yield credible scenarios for very early steps of evolution that otherwise appear to be out of reach.
Reviewers
Eric Bapteste, Patrick Forterre, and Mark Ragan.
doi:10.1186/1745-6150-1-39
PMCID: PMC1766352  PMID: 17176463
17.  Comparative Genomic Analysis of Hyperthermophilic Archaeal Fuselloviridae Viruses 
Journal of Virology  2004;78(4):1954-1961.
The complete genome sequences of two Sulfolobus spindle-shaped viruses (SSVs) from acidic hot springs in Kamchatka (Russia) and Yellowstone National Park (United States) have been determined. These nonlytic temperate viruses were isolated from hyperthermophilic Sulfolobus hosts, and both viruses share the spindle-shaped morphology characteristic of the Fuselloviridae family. These two genomes, in combination with the previously determined SSV1 genome from Japan and the SSV2 genome from Iceland, have allowed us to carry out a phylogenetic comparison of these geographically distributed hyperthermal viruses. Each virus contains a circular double-stranded DNA genome of ∼15 kbp with approximately 34 open reading frames (ORFs). These Fusellovirus ORFs show little or no similarity to genes in the public databases. In contrast, 18 ORFs are common to all four isolates and may represent the minimal gene set defining this viral group. In general, ORFs on one half of the genome are colinear and highly conserved, while ORFs on the other half are not. One shared ORF among all four genomes is an integrase of the tyrosine recombinase family. All four viral genomes integrate into their host tRNA genes. The specific tRNA gene used for integration varies, and one genome integrates into multiple loci. Several unique ORFs are found in the genome of each isolate.
doi:10.1128/JVI.78.4.1954-1961.2004
PMCID: PMC369504  PMID: 14747560
18.  Unprecedented genomic diversity of RNA viruses in arthropods reveals the ancestry of negative-sense RNA viruses 
eLife  null;4:e05378.
Although arthropods are important viral vectors, the biodiversity of arthropod viruses, as well as the role that arthropods have played in viral origins and evolution, is unclear. Through RNA sequencing of 70 arthropod species we discovered 112 novel viruses that appear to be ancestral to much of the documented genetic diversity of negative-sense RNA viruses, a number of which are also present as endogenous genomic copies. With this greatly enriched diversity we revealed that arthropods contain viruses that fall basal to major virus groups, including the vertebrate-specific arenaviruses, filoviruses, hantaviruses, influenza viruses, lyssaviruses, and paramyxoviruses. We similarly documented a remarkable diversity of genome structures in arthropod viruses, including a putative circular form, that sheds new light on the evolution of genome organization. Hence, arthropods are a major reservoir of viral genetic diversity and have likely been central to viral evolution.
DOI: http://dx.doi.org/10.7554/eLife.05378.001
eLife digest
Many illnesses, including influenza, hemorrhagic fever, and rabies, are caused by a group of viruses called negative-sense RNA viruses. The genetic information—or genome—of these viruses is encoded in strands of RNA that must be copied before they can be translated into the proteins needed to build new viruses. It is currently known that there are at least eight different families of these viruses, which have a wide range of shapes and sizes and arrange their RNA in different ways.
Insects, spiders, and other arthropods carry many different RNA viruses. Many of these viruses have not previously been studied, and those that have been studied so far are mainly those that cause diseases in humans and other vertebrates. Researchers therefore only know a limited amount about the diversity of the negative-sense RNA viruses that arthropods harbor and how these viruses evolved. Studying how viruses evolve helps scientists to understand what makes some viruses deadly and others harmless and can also help develop treatments or vaccines for the diseases caused by the viruses.
Li, Shi, Tian, Lin, Kang et al. collected 70 species of insects, spiders, centipedes, and other arthropods in China and sequenced all the negative-sense RNA viruses in the creatures. This revealed an enormous number of negative-sense RNA viruses, including 112 new viruses. Many of the newly discovered arthropod viruses appear to be the ancestors of disease-causing viruses, including influenza viruses and the filoviruses—the group that includes the Ebola virus. Indeed, it appears that arthropods host many—if not all—of the negative-sense RNA viruses that cause disease in vertebrates and plants.
While documenting the new RNA viruses and how they are related to each other, Li et al. found many different genome structures. Some genomes were segmented, which may play an important role in evolution as segments can be easily swapped to create new genetic combinations. Non-segmented and circular genomes were also found. This genetic diversity suggests that arthropods are likely to have played a key role in the evolution of new viruses by acting as a site where many different viruses can interact and exchange genetic information.
DOI: http://dx.doi.org/10.7554/eLife.05378.002
doi:10.7554/eLife.05378
PMCID: PMC4384744  PMID: 25633976
RNA virus; evolution; arthropods; segmentation; negative-sense; phylogeny; viruses
19.  Evolutionary connection between the catalytic subunits of DNA-dependent RNA polymerases and eukaryotic RNA-dependent RNA polymerases and the origin of RNA polymerases 
Background
The eukaryotic RNA-dependent RNA polymerase (RDRP) is involved in the amplification of regulatory microRNAs during post-transcriptional gene silencing. This enzyme is highly conserved in most eukaryotes but is missing in archaea and bacteria. No evolutionary relationship between RDRP and other polymerases has been reported so far, hence the origin of this eukaryote-specific polymerase remains a mystery.
Results
Using extensive sequence profile searches, we identified bacteriophage homologs of the eukaryotic RDRP. The comparison of the eukaryotic RDRP and their homologs from bacteriophages led to the delineation of the conserved portion of these enzymes, which is predicted to harbor the catalytic site. Further, detailed sequence comparison, aided by examination of the crystal structure of the DNA-dependent RNA polymerase (DDRP), showed that the RDRP and the β' subunit of DDRP (and its orthologs in archaea and eukaryotes) contain a conserved double-psi β-barrel (DPBB) domain. This DPBB domain contains the signature motif DbDGD (b is a bulky residue), which is conserved in all RDRPs and DDRPs and contributes to catalysis via a coordinated divalent cation. Apart from the DPBB domain, no similarity was detected between RDRP and DDRP, which leaves open two scenarios for the origin of RDRP: i) RDRP evolved at the onset of the evolution of eukaryotes via a duplication of the DDRP β' subunit followed by dramatic divergence that obliterated the sequence similarity outside the core catalytic domain and ii) the primordial RDRP, which consisted primarily of the DPBB domain, evolved from a common ancestor with the DDRP at a very early stage of evolution, during the RNA world era. The latter hypothesis implies that RDRP had been subsequently eliminated from cellular life forms and might have been reintroduced into the eukaryotic genomes through a bacteriophage. Sequence and structure analysis of the DDRP led to further insights into the evolution of RNA polymerases. In addition to the β' subunit, β subunit of DDRP also contains a DPBB domain, which is, however, distorted by large inserts and does not harbor a counterpart of the DbDGD motif. The DPBB domains of the two DDRP subunits together form the catalytic cleft, with the domain from the β' subunit supplying the metal-coordinating DbDGD motif and the one from the β subunit providing two lysine residues involved in catalysis. Given that the two DPBB domains of DDRP contribute completely different sets of active residues to the catalytic center, it is hypothesized that the ultimate ancestor of RNA polymerases functioned as a homodimer of a generic, RNA-binding DPBB domain. This ancestral protein probably did not have catalytic activity and served as a cofactor for a ribozyme RNA polymerase. Subsequent evolution of DDRP and RDRP involved accretion of distinct sets of additional domains. In the DDRPs, these included a RNA-binding Zn-ribbon, an AT-hook-like module and a sandwich-barrel hybrid motif (SBHM) domain. Further, lineage-specific accretion of SBHM domains and other, DDRP-specific domains is observed in bacterial DDRPs. In contrast, the orthologs of the β' subunit in archaea and eukaryotes contains a four-stranded α + β domain that is shared with the α-subunit of bacterial DDRP, eukaryotic DDRP subunit RBP11, translation factor eIF1 and type II topoisomerases. The additional domains of the RDRPs remain to be characterized.
Conclusions
Eukaryotic RNA-dependent RNA polymerases share the catalytic double-psi β-barrel domain, containing a signature metal-coordinating motif, with the universally conserved β' subunit of DNA-dependent RNA polymerases. Beyond this core catalytic domain, the two classes of RNA polymerases do not have common domains, suggesting early divergence from a common ancestor, with subsequent independent domain accretion. The β-subunit of DDRP contains another, highly diverged DPBB domain. The presence of two distinct DPBB domains in two subunits of DDRP is compatible with the hypothesis that the ultimate ancestor of RNA polymerases was a RNA-binding DPBB domain that had no catalytic activity but rather functioned as a homodimeric cofactor for a ribozyme polymerase.
doi:10.1186/1472-6807-3-1
PMCID: PMC151600  PMID: 12553882
20.  Diverse CRISPRs Evolving in Human Microbiomes 
PLoS Genetics  2012;8(6):e1002441.
CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci, together with cas (CRISPR–associated) genes, form the CRISPR/Cas adaptive immune system, a primary defense strategy that eubacteria and archaea mobilize against foreign nucleic acids, including phages and conjugative plasmids. Short spacer sequences separated by the repeats are derived from foreign DNA and direct interference to future infections. The availability of hundreds of shotgun metagenomic datasets from the Human Microbiome Project (HMP) enables us to explore the distribution and diversity of known CRISPRs in human-associated microbial communities and to discover new CRISPRs. We propose a targeted assembly strategy to reconstruct CRISPR arrays, which whole-metagenome assemblies fail to identify. For each known CRISPR type (identified from reference genomes), we use its direct repeat consensus sequence to recruit reads from each HMP dataset and then assemble the recruited reads into CRISPR loci; the unique spacer sequences can then be extracted for analysis. We also identified novel CRISPRs or new CRISPR variants in contigs from whole-metagenome assemblies and used targeted assembly to more comprehensively identify these CRISPRs across samples. We observed that the distributions of CRISPRs (including 64 known and 86 novel ones) are largely body-site specific. We provide detailed analysis of several CRISPR loci, including novel CRISPRs. For example, known streptococcal CRISPRs were identified in most oral microbiomes, totaling ∼8,000 unique spacers: samples resampled from the same individual and oral site shared the most spacers; different oral sites from the same individual shared significantly fewer, while different individuals had almost no common spacers, indicating the impact of subtle niche differences on the evolution of CRISPR defenses. We further demonstrate potential applications of CRISPRs to the tracing of rare species and the virus exposure of individuals. This work indicates the importance of effective identification and characterization of CRISPR loci to the study of the dynamic ecology of microbiomes.
Author Summary
Human bodies are complex ecological systems in which various microbial organisms and viruses interact with each other and with the human host. The Human Microbiome Project (HMP) has resulted in >700 datasets of shotgun metagenomic sequences, from which we can learn about the compositions and functions of human-associated microbial communities. CRISPR/Cas systems are a widespread class of adaptive immune systems in bacteria and archaea, providing acquired immunity against foreign nucleic acids: CRISPR/Cas defense pathways involve integration of viral- or plasmid-derived DNA segments into CRISPR arrays (forming spacers between repeated structural sequences), and expression of short crRNAs from these single repeat-spacer units, to generate interference to future invading foreign genomes. Powered by an effective computational approach (the targeted assembly approach for CRISPR), our analysis of CRISPR arrays in the HMP datasets provides the very first global view of bacterial immunity systems in human-associated microbial communities. The great diversity of CRISPR spacers we observed among different body sites, in different individuals, and in single individuals over time, indicates the impact of subtle niche differences on the evolution of CRISPR defenses and indicates the key role of bacteriophage (and plasmids) in shaping human microbial communities.
doi:10.1371/journal.pgen.1002441
PMCID: PMC3374615  PMID: 22719260
21.  Relationships among the positive strand and double-strand RNA viruses as viewed through their RNA-dependent RNA polymerases. 
Nucleic Acids Research  1991;19(2):217-226.
The sequences of 50 RNA-dependent RNA polymerases (RDRPs) from 43 positive strand and 7 double strand RNA (dsRNA) viruses have been compared. The alignment permitted calculation of distances among the 50 viruses and a resultant dendrogram based on every amino acid, rather than just those amino acids in the conserved motifs. Remarkably, a large subgroup of these viruses, including vertebrate, plant, and insect viruses, forms a single cluster whose only common characteristic is exploitation of insect hosts or vectors. This similarity may be due to molecular constraints associated with a present and/or past ability to infect insects and/or to common descent from insect viruses. If common descent is important, as it appears to be, all the positive strand RNA viruses of eucaryotes except for the picornaviruses may have evolved from an ancestral dsRNA virus. Viral RDRPs appear to be inherited as modules rather than as portions of single RNA segments, implying that RNA recombination has played an important role in their dissemination.
PMCID: PMC333583  PMID: 2014162
22.  The Discovery, Distribution, and Evolution of Viruses Associated with Drosophila melanogaster 
PLoS Biology  2015;13(7):e1002210.
Drosophila melanogaster is a valuable invertebrate model for viral infection and antiviral immunity, and is a focus for studies of insect-virus coevolution. Here we use a metagenomic approach to identify more than 20 previously undetected RNA viruses and a DNA virus associated with wild D. melanogaster. These viruses not only include distant relatives of known insect pathogens but also novel groups of insect-infecting viruses. By sequencing virus-derived small RNAs, we show that the viruses represent active infections of Drosophila. We find that the RNA viruses differ in the number and properties of their small RNAs, and we detect both siRNAs and a novel miRNA from the DNA virus. Analysis of small RNAs also allows us to identify putative viral sequences that lack detectable sequence similarity to known viruses. By surveying >2,000 individually collected wild adult Drosophila we show that more than 30% of D. melanogaster carry a detectable virus, and more than 6% carry multiple viruses. However, despite a high prevalence of the Wolbachia endosymbiont—which is known to be protective against virus infections in Drosophila—we were unable to detect any relationship between the presence of Wolbachia and the presence of any virus. Using publicly available RNA-seq datasets, we show that the community of viruses in Drosophila laboratories is very different from that seen in the wild, but that some of the newly discovered viruses are nevertheless widespread in laboratory lines and are ubiquitous in cell culture. By sequencing viruses from individual wild-collected flies we show that some viruses are shared between D. melanogaster and D. simulans. Our results provide an essential evolutionary and ecological context for host–virus interaction in Drosophila, and the newly reported viral sequences will help develop D. melanogaster further as a model for molecular and evolutionary virus research.
Sequencing of metagenomic RNA and small RNA identifies more than 20 new viruses associated with the fruit fly Drosophila melanogaster, and large-scale surveys show that many are common in the lab and in the field.
Author Summary
The fruit fly Drosophila melanogaster is extensively used as a model species for molecular biology and genetics. It is also widely studied for its evolutionary history, helping us understand how natural selection has shaped the genome. Drosophila research has been particularly valuable in determining how the insect immune system interacts with viruses and how co-evolution between hosts and viruses can shape the host immune system. Understanding insect–virus coevolution is important because some viruses—such as those which cause dengue and yellow fever in humans—also infect their insect vectors, and because the viruses of bees and other pollinators are implicated in pollinator decline. Although we have an increasingly good idea of how flies recognise and combat viral pathogens, we still have much to learn about the viruses they encounter and interact with in the wild. In this paper, we sequence all of the genetic material from a large collection of wild fruit flies and use it to identify more than 20 new viruses. We then survey individual wild flies and laboratory stocks to find out which viruses are common, which are rare, and which species of fruit fly they infect. Our results provide valuable tools and an evolutionary and ecological perspective that will help to improve Drosophila as a model for host–virus biology in the future.
doi:10.1371/journal.pbio.1002210
PMCID: PMC4501690  PMID: 26172158
23.  Infectious Bronchitis Virus Generates Spherules from Zippered Endoplasmic Reticulum Membranes 
mBio  2013;4(5):e00801-13.
ABSTRACT
Replication of positive-sense RNA viruses is associated with the rearrangement of cellular membranes. Previous work on the infection of tissue culture cell lines with the betacoronaviruses mouse hepatitis virus and severe acute respiratory syndrome coronavirus (SARS-CoV) showed that they generate double-membrane vesicles (DMVs) and convoluted membranes as part of a reticular membrane network. Here we describe a detailed study of the membrane rearrangements induced by the avian gammacoronavirus infectious bronchitis virus (IBV) in a mammalian cell line but also in primary avian cells and in epithelial cells of ex vivo tracheal organ cultures. In all cell types, structures novel to IBV infection were identified that we have termed zippered endoplasmic reticulum (ER) and spherules. Zippered ER lacked luminal space, suggesting zippering of ER cisternae, while spherules appeared as uniform invaginations of zippered ER. Electron tomography showed that IBV-induced spherules are tethered to the zippered ER and that there is a channel connecting the interior of the spherule with the cytoplasm, a feature thought to be necessary for sites of RNA synthesis but not seen previously for membrane rearrangements induced by coronaviruses. We also identified DMVs in IBV-infected cells that were observed as single individual DMVs or were connected to the ER via their outer membrane but not to the zippered ER. Interestingly, IBV-induced spherules strongly resemble confirmed sites of RNA synthesis for alphaviruses, nodaviruses, and bromoviruses, which may indicate similar strategies of IBV and these diverse viruses for the assembly of RNA replication complexes.
IMPORTANCE All positive-sense single-stranded RNA viruses induce rearranged cellular membranes, providing a platform for viral replication complex assembly and protecting viral RNA from cellular defenses. We have studied the membrane rearrangements induced by an important poultry pathogen, the gammacoronavirus infectious bronchitis virus (IBV). Previous work studying closely related betacoronaviruses identified double-membrane vesicles (DMVs) and convoluted membranes (CMs) derived from the endoplasmic reticulum (ER) in infected cells. However, the role of DMVs and CMs in viral RNA synthesis remains unclear because these sealed vesicles lack a means of delivering viral RNA to the cytoplasm. Here, we characterized structures novel to IBV infection: zippered ER and small vesicles tethered to the zippered ER termed spherules. Significantly, spherules contain a channel connecting their interior to the cytoplasm and strongly resemble confirmed sites of RNA synthesis for other positive-sense RNA viruses, making them ideal candidates for the site of IBV RNA synthesis.
IMPORTANCE
All positive-sense single-stranded RNA viruses induce rearranged cellular membranes, providing a platform for viral replication complex assembly and protecting viral RNA from cellular defenses. We have studied the membrane rearrangements induced by an important poultry pathogen, the gammacoronavirus infectious bronchitis virus (IBV). Previous work studying closely related betacoronaviruses identified double-membrane vesicles (DMVs) and convoluted membranes (CMs) derived from the endoplasmic reticulum (ER) in infected cells. However, the role of DMVs and CMs in viral RNA synthesis remains unclear because these sealed vesicles lack a means of delivering viral RNA to the cytoplasm. Here, we characterized structures novel to IBV infection: zippered ER and small vesicles tethered to the zippered ER termed spherules. Significantly, spherules contain a channel connecting their interior to the cytoplasm and strongly resemble confirmed sites of RNA synthesis for other positive-sense RNA viruses, making them ideal candidates for the site of IBV RNA synthesis.
doi:10.1128/mBio.00801-13
PMCID: PMC3812713  PMID: 24149513
24.  Novel Division Level Bacterial Diversity in a Yellowstone Hot Spring 
Journal of Bacteriology  1998;180(2):366-376.
A culture-independent molecular phylogenetic survey was carried out for the bacterial community in Obsidian Pool (OP), a Yellowstone National Park hot spring previously shown to contain remarkable archaeal diversity (S. M. Barns, R. E. Fundyga, M. W. Jeffries, and N. R. Page, Proc. Natl. Acad. Sci. USA 91:1609–1613, 1994). Small-subunit rRNA genes (rDNA) were amplified directly from OP sediment DNA by PCR with universally conserved or Bacteria-specific rDNA primers and cloned. Unique rDNA types among >300 clones were identified by restriction fragment length polymorphism, and 122 representative rDNA sequences were determined. These were found to represent 54 distinct bacterial sequence types or clusters (≥98% identity) of sequences. A majority (70%) of the sequence types were affiliated with 14 previously recognized bacterial divisions (main phyla; kingdoms); 30% were unaffiliated with recognized bacterial divisions. The unaffiliated sequence types (represented by 38 sequences) nominally comprise 12 novel, division level lineages termed candidate divisions. Several OP sequences were nearly identical to those of cultivated chemolithotrophic thermophiles, including the hydrogen-oxidizing Calderobacterium and the sulfate reducers Thermodesulfovibrio and Thermodesulfobacterium, or belonged to monophyletic assemblages recognized for a particular type of metabolism, such as the hydrogen-oxidizing Aquificales and the sulfate-reducing δ-Proteobacteria. The occurrence of such organisms is consistent with the chemical composition of OP (high in reduced iron and sulfur) and suggests a lithotrophic base for primary productivity in this hot spring, through hydrogen oxidation and sulfate reduction. Unexpectedly, no archaeal sequences were encountered in OP clone libraries made with universal primers. Hybridization analysis of amplified OP DNA with domain-specific probes confirmed that the analyzed community rDNA from OP sediment was predominantly bacterial. These results expand substantially our knowledge of the extent of bacterial diversity and call into question the commonly held notion that Archaea dominate hydrothermal environments. Finally, the currently known extent of division level bacterial phylogenetic diversity is collated and summarized.
PMCID: PMC106892  PMID: 9440526
25.  Avian reovirus L2 genome segment sequences and predicted structure/function of the encoded RNA-dependent RNA polymerase protein 
Virology Journal  2008;5:153.
Background
The orthoreoviruses are infectious agents that possess a genome comprised of 10 double-stranded RNA segments encased in two concentric protein capsids. Like virtually all RNA viruses, an RNA-dependent RNA polymerase (RdRp) enzyme is required for viral propagation. RdRp sequences have been determined for the prototype mammalian orthoreoviruses and for several other closely-related reoviruses, including aquareoviruses, but have not yet been reported for any avian orthoreoviruses.
Results
We determined the L2 genome segment nucleotide sequences, which encode the RdRp proteins, of two different avian reoviruses, strains ARV138 and ARV176 in order to define conserved and variable regions within reovirus RdRp proteins and to better delineate structure/function of this important enzyme. The ARV138 L2 genome segment was 3829 base pairs long, whereas the ARV176 L2 segment was 3830 nucleotides long. Both segments were predicted to encode λB RdRp proteins 1259 amino acids in length. Alignments of these newly-determined ARV genome segments, and their corresponding proteins, were performed with all currently available homologous mammalian reovirus (MRV) and aquareovirus (AqRV) genome segment and protein sequences. There was ~55% amino acid identity between ARV λB and MRV λ3 proteins, making the RdRp protein the most highly conserved of currently known orthoreovirus proteins, and there was ~28% identity between ARV λB and homologous MRV and AqRV RdRp proteins. Predictive structure/function mapping of identical and conserved residues within the known MRV λ3 atomic structure indicated most identical amino acids and conservative substitutions were located near and within predicted catalytic domains and lining RdRp channels, whereas non-identical amino acids were generally located on the molecule's surfaces.
Conclusion
The ARV λB and MRV λ3 proteins showed the highest ARV:MRV identity values (~55%) amongst all currently known ARV and MRV proteins. This implies significant evolutionary constraints are placed on dsRNA RdRp molecules, particularly in regions comprising the canonical polymerase motifs and residues thought to interact directly with template and nascent mRNA. This may point the way to improved design of anti-viral agents specifically targeting this enzyme.
doi:10.1186/1743-422X-5-153
PMCID: PMC2615760  PMID: 19091125

Results 1-25 (1356060)