Understanding the biological roles of microRNAs (miRNAs) is a an active area of research that has produced a surge of publications in PubMed, particularly in cancer research. Along with this increasing interest, many open-source bioinformatics tools to identify existing and/or discover novel miRNAs in next-generation sequencing (NGS) reads become available. While miRNA identification and discovery tools are significantly improved, the development of miRNA differential expression analysis tools, especially in temporal studies, remains substantially challenging. Further, the installation of currently available software is non-trivial and steps of testing with example datasets, trying with one’s own dataset, and interpreting the results require notable expertise and time. Subsequently, there is a strong need for a tool that allows scientists to normalize raw data, perform statistical analyses, and provide intuitive results without having to invest significant efforts.
We have developed miRNA Temporal Analyzer (mirnaTA), a bioinformatics package to identify differentially expressed miRNAs in temporal studies. mirnaTA is written in Perl and R (Version 2.13.0 or later) and can be run across multiple platforms, such as Linux, Mac and Windows. In the current version, mirnaTA requires users to provide a simple, tab-delimited, matrix file containing miRNA name and count data from a minimum of two to a maximum of 20 time points and three replicates. To recalibrate data and remove technical variability, raw data is normalized using Normal Quantile Transformation (NQT), and linear regression model is used to locate any miRNAs which are differentially expressed in a linear pattern. Subsequently, remaining miRNAs which do not fit a linear model are further analyzed in two different non-linear methods 1) cumulative distribution function (CDF) or 2) analysis of variances (ANOVA). After both linear and non-linear analyses are completed, statistically significant miRNAs (P < 0.05) are plotted as heat maps using hierarchical cluster analysis and Euclidean distance matrix computation methods.
mirnaTA is an open-source, bioinformatics tool to aid scientists in identifying differentially expressed miRNAs which could be further mined for biological significance. It is expected to provide researchers with a means of interpreting raw data to statistical summaries in a fast and intuitive manner.
microRNA; miRNA Temporal Analyzer; mirnaTA; Time series; Differential expression; DE; Quantile normalization; Linear model; Normal quantile transformation
We present draft genome sequences of 15 clinical Vibrio isolates of various serogroups. These are valuable data for use in studying Vibrio cholerae genetic diversity, epidemic potential, and strain attribution.
In order to better characterize the Bacillus anthracis typing phage AP50c, we designed a genetic screen to identify its bacterial receptor. Insertions of the transposon mariner or targeted deletions of the structural gene for the S-layer protein Sap and the sporulation genes spo0A, spo0B, and spo0F in B. anthracis Sterne resulted in phage resistance with concomitant defects in phage adsorption and infectivity. Electron microscopy of bacteria incubated with AP50c revealed phage particles associated with the surface of bacilli of the Sterne strain but not with the surfaces of Δsap, Δspo0A, Δspo0B, or Δspo0F mutants. The amount of Sap in the S layer of each of the spo0 mutant strains was substantially reduced compared to that of the parent strain, and incubation of AP50c with purified recombinant Sap led to a substantial reduction in phage activity. Phylogenetic analysis based on whole-genome sequences of B. cereus sensu lato strains revealed several closely related B. cereus and B. thuringiensis strains that carry sap genes with very high similarities to the sap gene of B. anthracis. Complementation of the Δsap mutant in trans with the wild-type B. anthracis
sap or the sap gene from either of two different B. cereus strains that are sensitive to AP50c infection restored phage sensitivity, and electron microscopy confirmed attachment of phage particles to the surface of each of the complemented strains. Based on these data, we postulate that Sap is involved in AP50c infectivity, most likely acting as the phage receptor, and that the spo0 genes may regulate synthesis of Sap and/or formation of the S layer.
Middle East respiratory syndrome coronavirus (MERS-CoV) is the etiologic agent of a highly lethal pneumonia. Here, we report the full-genome sequence of the Jordan-N3/2012 strain after serial passage in two distinct mammalian cell lines. The genome exhibits noteworthy stability, which may inform the development of vaccines and therapeutics used to treat infection with this virus.
The development of Next Generation Sequencing (NGS) during the last decade has created an unprecedented amount of sequencing data, as well as the ability to rapidly sequence specimens of interest. Read-based BLAST analysis of NGS data is a common procedure especially in the case of metagenomic samples. However, coverage is usually not enough to allow for de novo assembly. This type of read-based analysis often creates the question of how the reads that align to the same sequence are distributed. The same question applies to preparation of primers or probes for microarray experiments. Although there are several packages that allow the visualization of DNA segments in relation to a reference, in most cases they require the visualization of one reference at a time and the capture of screen shots for each segment. Such a procedure could be tedious and time consuming. The field is in need of a solution that automates the capture of coverage plots for all the segments of interest.
We have created BLASTPLOT, a PERL module to quickly plot the BLAST results from short sequences (primers, probes, reads) against reference targets.
BLASTPLOT is a simple to use PERL module that allows the generation of PNG graphs for all the reference sequences associated with a BLAST result set.
BLAST; Plot; PERL; Package; Graph; Primer; Probe; NGS; Reads; Sequencing
The introduction of benchtop sequencers has made adoption of whole genome sequencing possible for a broader community of researchers than ever before. Concurrently, metagenomic sequencing (MGS) is rapidly emerging as a tool for interrogating complex samples that defy conventional analyses. In addition, next-generation sequencers are increasingly being used in clinical or related settings, for instance to track outbreaks. However, information regarding the analytical sensitivity or limit of detection (LoD) of benchtop sequencers is currently lacking. Furthermore, the specificity of sequence information at or near the LoD is unknown.
In the present study, we assess the ability of three next-generation sequencing platforms to identify a pathogen (viral or bacterial) present in low titers in a clinically relevant sample (blood). Our results indicate that the Roche-454 Titanium platform is capable of detecting Dengue virus at titers as low as 1X102.5 pfu/mL, corresponding to an estimated 5.4X104 genome copies/ml maximum. The increased throughput of the benchtop sequencers, the Ion Torrent PGM and Illumina MiSeq platforms, enabled detection of viral genomes at concentrations as low as 1X104 genome copies/mL. Platform-specific biases were evident in sequence read distributions as well as viral genome coverage. For bacterial samples, only the MiSeq platform was able to provide sequencing reads that could be unambiguously classified as originating from Bacillus anthracis.
The analytical sensitivity of all three platforms approaches that of standard qPCR assays. Although all platforms were able to detect pathogens at the levels tested, there were several noteworthy differences. The Roche-454 Titanium platform produced consistently longer reads, even when compared with the latest chemistry updates for the PGM platform. The MiSeq platform produced consistently greater depth and breadth of coverage, while the Ion Torrent was unequaled for speed of sequencing. None of the platforms were able to verify a single nucleotide polymorphism responsible for antiviral resistance in an Influenza A strain isolated from the 2009 H1N1 pandemic. Overall, the benchtop platforms perform well for identification of pathogens from a representative clinical sample. However, unlike identification, characterization of pathogens is likely to require higher titers, multiple libraries and/or multiple sequencing runs.
The henipaviruses, represented by Hendra (HeV) and Nipah (NiV) viruses are highly pathogenic zoonotic paramyxoviruses with uniquely broad host tropisms responsible for repeated outbreaks in Australia, Southeast Asia, India and Bangladesh. The high morbidity and mortality rates associated with infection and lack of licensed antiviral therapies make the henipaviruses a potential biological threat to humans and livestock. Henipavirus entry is initiated by the attachment of the G envelope glycoprotein to host cell membrane receptors. Previously, henipavirus-neutralizing human monoclonal antibodies (hmAb) have been isolated using the HeV-G glycoprotein and a human naïve antibody library. One cross-reactive and receptor-blocking hmAb (m102.4) was recently demonstrated to be an effective post-exposure therapy in two animal models of NiV and HeV infection, has been used in several people on a compassionate use basis, and is currently in development for use in humans. Here, we report the crystal structure of the complex of HeV-G with m102.3, an m102.4 derivative, and describe NiV and HeV escape mutants. This structure provides detailed insight into the mechanism of HeV and NiV neutralization by m102.4, and serves as a blueprint for further optimization of m102.4 as a therapeutic agent and for the development of entry inhibitors and vaccines.
Since their initial emergence, henipaviruses have continued to cause spillover events in both human and livestock populations, posing significant biothreats. Currently there are no licensed or approved therapies for treatment of henipavirus infection and the human case mortality rates average >70%. We used X-ray crystallography to determine the high-resolution structures of the Hendra virus G glycoprotein in complex with a cross-reactive neutralizing human monoclonal antibody. The structures provide detailed insight into the mechanism of HeV and NiV neutralization by this potent and clinically-relevant human monoclonal antibody that is currently in development for use in humans. This monoclonal antibody was recently shown to be an effective post-exposure therapy in non-human models of lethal Hendra virus infection. Indeed, it has already been used in four people on a compassionate use request, three in Australia and one in the United States, as a therapeutic agent. Furthermore, we identified and characterized two escape mutants generated in vitro and evaluated their mechanism of escape. Our results serve as a blueprint for further optimization of this antibody and for the development of novel entry inhibitors and vaccines. This report also supports the additional pre-clinical data required for eventual licensure by detailing the antibody's mechanism of henipavirus neutralization.
Rickettsia prowazekii is a notable intracellular pathogen, the agent of epidemic typhus, and a potential biothreat agent. We present here whole-genome sequence data for four strains of R. prowazekii, including one from a flying squirrel.
Vibrio cholerae O139 Bengal is the only serogroup other than O1 implicated in cholera epidemics. We describe the isolation and characterization of an O139 serogroup-specific phage, vB_VchP_VchO139-I (ϕVchO139-I) that has similar host range and virion morphology as phage vB_VchP_JA1 (ϕJA1) described previously. We aimed at a complete molecular characterization of both phages and elucidation of their genetic and structural differences and assessment of their genetic relatedness to the N4-like phage group.
Host-range analysis and plaque morphology screening were done for both ϕJA1 and ϕVchO139-I. Both phage genomes were sequenced by a 454 and Sanger hybrid approach. Genomes were annotated and protein homologies were determined by Blast and HHPred. Restriction profiles, PFGE patterns and data on the physical genome structure were acquired and phylogenetic analyses were performed.
The host specificity of ϕJA1 has been attributed to the unique capsular O-antigen produced by O139 strains. Plaque morphologies of the two phages were different; ϕVchO139-I produced a larger halo around the plaques than ϕJA1. Restriction profiles of ϕJA1 and ϕVchO139-I genomes were also different. The genomes of ϕJA1 and ϕVchO139-I consisted of linear double-stranded DNA of 71,252 and 70,938 base pairs. The presence of direct terminal repeats of around 1974 base pairs was demonstrated. Whole genome comparison revealed single nucleotide polymorphisms, small insertions/deletions and differences in gene content. Both genomes had 79 predicted protein encoding sequences, of which only 59 were identical between the two closely related phages. They also encoded one tRNA-Arg gene, an intein within the large terminase gene, and four homing endonuclease genes. Whole genome phylogenetic analyses of ϕJA1 and ϕVchO139-I against other sequenced N4-like phages delineate three novel subgroups or clades within this phage family.
The closely related phages feature significant genetic differences, in spite of being morphologically identical. The phage morphology, genetic organization, genomic content and large terminase protein based phylogeny support the placement of these two phages in the Podoviridae family, more specifically within the N4-like phage group. The physical genome structure of ϕJA1 could be demonstrated experimentally. Our data pave the way for potential use of ϕJA1 and ϕVchO139-I in Vibrio cholerae typing and control.
Vibrio cholerae O139; N4-like virus; Genome comparison; Terminal repeats; Intein; Phylogenetic relationship
Clostridium difficile causes one of the leading nosocomial infections in developed countries, and therapeutic choices are limited. Some strains of C. difficile produce phage tail-like particles upon induction of the SOS response. These particles have bactericidal activity against other C. difficile strains and can therefore be classified as bacteriocins, similar to the R-type pyocins of Pseudomonas aeruginosa. These R-type bacteriocin particles, which have been purified from different strains, each have a different C. difficile-killing spectrum, with no one bacteriocin killing all C. difficile isolates tested. We have identified the genetic locus of these “diffocins” (open reading frames 1359 to 1376) and have found them to be common among the species. The entire diffocin genetic locus of more than 20 kb was cloned and expressed in Bacillus subtilis, and this resulted in production of bactericidal particles. One of the interesting features of these particles is a very large structural protein of ∼200 kDa, the product of gene 1374. This large protein determines the killing spectrum of the particles and is likely the receptor-binding protein. Diffocins may provide an alternate bactericidal agent to prevent or treat infections and to decolonize individuals who are asymptomatic carriers.
In May of 2011, an enteroaggregative Escherichia coli O104:H4 strain that had acquired a Shiga toxin 2-converting phage caused a large outbreak of bloody diarrhea in Europe which was notable for its high prevalence of hemolytic uremic syndrome cases. Several studies have described the genomic inventory and phylogenies of strains associated with the outbreak and a collection of historical E. coli O104:H4 isolates using draft genome assemblies. We present the complete, closed genome sequences of an isolate from the 2011 outbreak (2011C–3493) and two isolates from cases of bloody diarrhea that occurred in the Republic of Georgia in 2009 (2009EL–2050 and 2009EL–2071). Comparative genome analysis indicates that, while the Georgian strains are the nearest neighbors to the 2011 outbreak isolates sequenced to date, structural and nucleotide-level differences are evident in the Stx2 phage genomes, the mer/tet antibiotic resistance island, and in the prophage and plasmid profiles of the strains, including a previously undescribed plasmid with homology to the pMT virulence plasmid of Yersinia pestis. In addition, multiphenotype analysis showed that 2009EL–2071 possessed higher resistance to polymyxin and membrane-disrupting agents. Finally, we show evidence by electron microscopy of the presence of a common phage morphotype among the European and Georgian strains and a second phage morphotype among the Georgian strains. The presence of at least two stx2 phage genotypes in host genetic backgrounds that may derive from a recent common ancestor of the 2011 outbreak isolates indicates that the emergence of stx2 phage-containing E. coli O104:H4 strains probably occurred more than once, or that the current outbreak isolates may be the result of a recent transfer of a new stx2 phage element into a pre-existing stx2-positive genetic background.
Spontaneous Bacillus anthracis mutants resistant to infection by phage AP50c (AP50R) exhibit a mucoid colony phenotype and secrete an extracellular matrix.
Here we utilized a Roche/454-based whole genome sequencing approach to identify mutations that are candidates for conferring AP50c phage resistance, followed by genetic deletion and complementation studies to validate the whole genome sequence data and demonstrate that the implicated gene is necessary for AP50c phage infection.
Using whole genome sequence data, we mapped the relevant mutations in six AP50R strains to csaB. Eleven additional spontaneous mutants, isolated in two different genetic backgrounds, were screened by PCR followed by Sanger sequencing of the csaB gene. In each spontaneous mutant, we found either a non-synonymous substitution, a nonsense mutation, or a frame-shift mutation caused by single nucleotide polymorphisms or a 5 base pair insertion in csaB. All together, 5 and 12 of the 17 spontaneous mutations are predicted to yield altered full length and truncated CsaB proteins respectively. As expected from these results, a targeted deletion or frame-shift mutations introduced into csaB in a different genetic background, in a strain not exposed to AP50c, resulted in a phage resistant phenotype. Also, substitution of a highly conserved histidine residue with an alanine residue (H270A) in CsaB resulted in phage resistance, suggesting that a functional CsaB is necessary for phage sensitivity. Conversely, introduction of the wild type allele of csaB in cis into the csaB deletion mutant by homologous recombination or supplying the wild type CsaB protein in trans from a plasmid restored phage sensitivity. The csaB mutants accumulated cell wall material and appeared to have a defective S-layer, whereas these phenotypes were reverted in the complemented strains.
Taken together, these data suggest an essential role for csaB in AP50c phage infection, most likely in phage adsorption. (The whole genome sequences generated from this study have been submitted to GenBank under SRA project ID: SRA023659.1 and sample IDs: AP50 R1: SRS113675.1, AP50 R2: SRS113676.1, AP50 R3: SRS113728.1, AP50 R4: SRS113733.1, AP50 R6: SRS113734.1, JB220 Parent: SRS150209.1, JB220 Mutant: SRS150211.1).
Phage resistance; WGS; Mutation mapping; SNP; B. anthracis
The conventional and most accepted method of measuring the lytic activity of a phage against its bacterial host is the plaque assay. This method is laborious, time consuming and expensive, especially in high throughput analyses where multiple phage-bacterial interactions are required to be monitored simultaneously. It can also vary considerably with the experimenter and by the growth and plating conditions. Alternatively, the lytic activity can be measured indirectly by following the decrease in optical density of the bacterial cultures owing to lysis. Here we describe an automated, high throughput, indirect liquid lysis assay to evaluate phage growth using the OmniLogTM system. The OmniLogTM system uses redox chemistry, employing cell respiration as a universal reporter. During active growth of bacteria, cellular respiration reduces a tetrazolium dye and produces a color change that is measured in an automated fashion. On the other hand, successful phage infection and subsequent growth of the phage in its host bacterium results in reduced bacterial growth and respiration and a concomitant reduction in color. Here we show that microtiter plate wells inoculated with Bacillus anthracis and phage show decreased or no growth, compared with the wells containing bacteria only or phage resistant bacteria plus phage. Also, we show differences in the kinetics of bacterial growth and the timing of appearance of phage resistant bacteria in the presence of individual phages or a cocktail of B. anthracis specific phages. The results of these experiments indicate that the OmniLogTM system could be used reliably for indirectly measuring phage growth in high throughput host range and phage and antibiotics combination studies.
Bacillus anthracis; OmniLogTM; bacteriophage; in vitro lytic assay; phage
Five Y. pestis bacteriophages obtained from various sources were characterized to determine their biological properties, including their taxonomic classification, host range and genomic diversity. Four of the phages (YpP-G, Y, R and YpsP-G) belong to the Podoviridae family, and the fifth phage (YpsP-PST) belongs to the Myoviridae family, of the order Caudovirales comprising of double-stranded DNA phages. The genomes of the four Podoviridae phages were fully sequenced and found to be almost identical to each other and to those of two previously characterized Y. pestis phages Yepe2 and φA1122. However, despite their genomic homogeneity, they varied in their ability to lyse Y. pestis and Y. pseudotuberculosis strains. The five phages were combined to yield a “phage cocktail” (tentatively designated “YPP-100”) capable of lysing the 59 Y. pestis strains in our collection. YPP-100 was examined for its ability to decontaminate three different hard surfaces (glass, gypsum board and stainless steel) experimentally contaminated with a mixture of three genetically diverse Y. pestis strains CO92, KIM and 1670G. Five minutes of exposure to YPP-100 preparations containing phage concentrations of ca. 109, 108 and 107 PFU/mL completely eliminated all viable Y. pestis cells from all three surfaces, but a few viable cells were recovered from the stainless steel coupons treated with YPP-100 diluted to contain ca. 106 PFU/mL. However, even that highly diluted preparation significantly (p = < 0.05) reduced Y. pestis levels by ≥ 99.97%. Our data support the idea that Y. pestis phages may be useful for decontaminating various hard surfaces naturally- or intentionally-contaminated with Y. pestis.
bacteriophage; phage; Yersinia pestis; surface decontamination
OmniLog™ phenotype microarrays (PMs) have the capability to measure and compare the growth responses of biological samples upon exposure to hundreds of growth conditions such as different metabolites and antibiotics over a time course of hours to days. In order to manage the large amount of data produced from the OmniLog™ instrument, PheMaDB (Phenotype Microarray DataBase), a web-based relational database, was designed. PheMaDB enables efficient storage, retrieval and rapid analysis of the OmniLog™ PM data.
PheMaDB allows the user to quickly identify records of interest for data analysis by filtering with a hierarchical ordering of Project, Strain, Phenotype, Replicate, and Temperature. PheMaDB then provides various statistical analysis options to identify specific growth pattern characteristics of the experimental strains, such as: outlier analysis, negative controls analysis (signal/background calibration), bar plots, pearson's correlation matrix, growth curve profile search, k-means clustering, and a heat map plot. This web-based database management system allows for both easy data sharing among multiple users and robust tools to phenotype organisms of interest.
PheMaDB is an open source system standardized for OmniLog™ PM data. PheMaDB could facilitate the banking and sharing of phenotype data. The source code is available for download at http://phemadb.sourceforge.net.
Despite the decades-long use of Bacillus atrophaeus var. globigii (BG) as a simulant for biological warfare (BW) agents, knowledge of its genome composition is limited. Furthermore, the ability to differentiate signatures of deliberate adaptation and selection from natural variation is lacking for most bacterial agents. We characterized a lineage of BGwith a long history of use as a simulant for BW operations, focusing on classical bacteriological markers, metabolic profiling and whole-genome shotgun sequencing (WGS).
Archival strains and two “present day” type strains were compared to simulant strains on different laboratory media. Several of the samples produced multiple colony morphotypes that differed from that of an archival isolate. To trace the microevolutionary history of these isolates, we obtained WGS data for several archival and present-day strains and morphotypes. Bacillus-wide phylogenetic analysis identified B. subtilis as the nearest neighbor to B. atrophaeus. The genome of B. atrophaeus is, on average, 86% identical to B. subtilis on the nucleotide level. WGS of variants revealed that several strains were mixed but highly related populations and uncovered a progressive accumulation of mutations among the “military” isolates. Metabolic profiling and microscopic examination of bacterial cultures revealed enhanced growth of “military” isolates on lactate-containing media, and showed that the “military” strains exhibited a hypersporulating phenotype.
Our analysis revealed the genomic and phenotypic signatures of strain adaptation and deliberate selection for traits that were desirable in a simulant organism. Together, these results demonstrate the power of whole-genome and modern systems-level approaches to characterize microbial lineages to develop and validate forensic markers for strain discrimination and reveal signatures of deliberate adaptation.
Complete sequences of 9.5-kb pPCP1 plasmids in three Yersinia pestis strains from the former Soviet Union (FSU) were determined and compared with those of pPCP1 plasmids in three well-characterized, non-FSU Y. pestis strains (KIM, CO92, and 91001). Two of the FSU plasmids were from strains C2614 and C2944, isolated from plague foci in Russia, and one plasmid was from strain C790 from Kyrgyzstan. Sequence analyses identified four sequence types among the six plasmids. The pPCP1 plasmids in the FSU strains were most genetically related to the pPCP1 plasmid in the KIM strain and least related to the pPCP1 plasmid in Y. pestis 91001. The FSU strains generally had larger pPCP1 plasmid copy numbers compared to strain CO92. Expression of the plasmid's pla gene was significantly (P ≤ .05) higher in strain C2944 than in strain CO92. Given pla's role in Y. pestis virulence, this difference may have important implications for the strain's virulence.
Despite the global threat caused by arthropod-borne viruses, there is not an efficient method for screening vector populations to detect novel viral sequences. Current viral detection and surveillance methods based on culture can be costly and time consuming and are predicated on prior knowledge of the etiologic agent, as they rely on specific oligonucleotide primers or antibodies. Therefore, these techniques may be unsuitable for situations when the causative agent of an outbreak is unknown.
In this study we explored the use of high-throughput pyrosequencing for surveillance of arthropod-borne RNA viruses. Dengue virus, a member of the positive strand RNA Flavivirus family that is transmitted by several members of the Aedes genus of mosquitoes, was used as a model. Aedes aegypti mosquitoes experimentally infected with dengue virus type 1 (DENV-1) were pooled with noninfected mosquitoes to simulate samples derived from ongoing arbovirus surveillance programs. Using random-primed methods, total RNA was reverse-transcribed and resulting cDNA subjected to 454 pyrosequencing.
In two types of samples, one with 5 adult mosquitoes infected with DENV-1- and the other with 1 DENV-1 infected mosquito and 4 noninfected mosquitoes, we identified DENV-1 DNA sequences. DENV-1 sequences were not detected in an uninfected control pool of 5 adult mosquitoes. We calculated the proportion of the Ae. aegypti metagenome contributed by each infecting Dengue virus genome (pIP), which ranged from 2.75×10−8 to 1.08×10−7. DENV-1 RNA was sufficiently concentrated in the mosquito that its detection was feasible using current high-throughput sequencing instrumentation. We also identified some of the components of the mosquito microflora on the basis of the sequence of expressed RNA. This included members of the bacterial genera Pirellula and Asaia, various fungi, and a potentially uncharacterized mycovirus.
Traditional methods for virus detection often rely on specific attributes, such as DNA sequences, of the viruses and therefore they not only require a priori knowledge of the agent in question, but they also are generally very specific in nature, capable of detecting viruses only from within a specific family, for example. Nextgen sequencing shows much promise for detection/diagnostic applications because of its ever-increasing throughput, decreasing cost, and unbiased nature. We investigated the applicability of 454 pyrosequencing for viral surveillance of insect populations, using Aedes aegypti mosquitoes experimentally inoculated with Dengue virus type 1 (DENV-1) and calculated what proportion of the total nucleic acid from crushed mosquitoes was contributed by the virus. We concluded that 454 pyrosequencing is capable of detecting even very small amounts of a known virus from within a pool of infected and noninfected mosquitoes, but for the amount of sequencing reads required to detect the virus, this technique may currently be too cost-prohibitive for use in large-scale surveillance efforts. Interesting byproducts of our study included a glimpse into what symbiotic organisms Ae. aegypti may harbor, as well as what genes may be differentially expressed in a DENV-1-infected versus noninfected mosquito.
The anthrax letter attacks of 2001 highlighted the need for rapid identification of biothreat agents not only for epidemiological surveillance of the intentional outbreak but also for implementing appropriate countermeasures, such as antibiotic treatment, in a timely manner to prevent further casualties. It is clear from the 2001 cases that survival may be markedly improved by administration of antimicrobial therapy during the early symptomatic phase of the illness; i.e., within 3 days of appearance of symptoms. Microbiological detection methods are feasible only for organisms that can be cultured in vitro and cannot detect all genetic modifications with the exception of antibiotic resistance. Currently available immuno or nucleic acid-based rapid detection assays utilize known, organism-specific proteins or genomic DNA signatures respectively. Hence, these assays lack the ability to detect novel natural variations or intentional genetic modifications that circumvent the targets of the detection assays or in the case of a biological attack using an antibiotic resistant or virulence enhanced Bacillus anthracis, to advise on therapeutic treatments.
We show here that the Roche 454-based pyrosequencing can generate whole genome draft sequences of deep and broad enough coverage of a bacterial genome in less than 24 hours. Furthermore, using the unfinished draft sequences, we demonstrate that unbiased identification of known as well as heretofore-unreported genetic modifications that include indels and single nucleotide polymorphisms conferring antibiotic and phage resistances is feasible within the next 12 hours.
Second generation sequencing technologies have paved the way for sequence-based rapid identification of both known and previously undocumented genetic modifications in cultured, conventional and newly emerging biothreat agents. Our findings have significant implications in the context of whole genome sequencing-based routine clinical diagnostics as well as epidemiological surveillance of natural disease outbreaks caused by bacterial and viral agents.
While new sequencing technologies have ushered in an era where microbial genomes can be easily sequenced, the goal of routinely producing high-quality draft and finished genomes in a cost-effective fashion has still remained elusive. Due to shorter read lengths and limitations in library construction protocols, shotgun sequencing and assembly based on these technologies often results in fragmented assemblies. Correspondingly, while draft assemblies can be obtained in days, finishing can take many months and hence the time and effort can only be justified for high-priority genomes and in large sequencing centers. In this work, we revisit this issue in light of our own experience in producing finished and nearly-finished genomes for a range of microbial species in a small-lab setting. These genomes were finished with surprisingly little investments in terms of time, computational effort and lab work, suggesting that the increased access to sequencing might also eventually lead to a greater proportion of finished genomes from small labs and genomics cores.