The last decade has seen a staggering transformation in our knowledge of microbial communities. Here, seven short pieces speculate as to what the next ten years might hold in store.
The development of high-throughput sequencing technologies has transformed our capacity to investigate the composition and dynamics of the microbial communities that populate diverse habitats. Over the past decade, these advances have yielded an avalanche of metagenomic data. The current stage of “van Leeuwenhoek”–like cataloguing, as well as functional analyses, will likely accelerate as DNA and RNA sequencing, plus protein and metabolic profiling capacities and computational tools, continue to improve. However, it is time to consider: what’s next for microbiome research? The short pieces included here briefly consider the challenges and opportunities awaiting microbiome research.
Human papillomavirus (HPV) causes a number of neoplastic diseases in humans. Here, we show a complex normal HPV community in a cohort of 103 healthy human subjects, by metagenomics analysis of the shotgun sequencing data generated from the NIH Human Microbiome Project. The overall HPV prevalence was 68.9% and was highest in the skin (61.3%), followed by the vagina (41.5%), mouth (30%), and gut (17.3%). Of the 109 HPV types as well as additional unclassified types detected, most were undetectable by the widely used commercial kits targeting the vaginal/cervical HPV types. These HPVs likely represent true HPV infections rather than transitory exposure because of strong organ tropism and persistence of the same HPV types in repeat samples. Coexistence of multiple HPV types was found in 48.1% of the HPV-positive samples. Networking between HPV types, cooccurrence or exclusion, was detected in vaginal and skin samples. Large contigs assembled from short HPV reads were obtained from several samples, confirming their genuine HPV origin. This first large-scale survey of HPV using a shotgun sequencing approach yielded a comprehensive map of HPV infections among different body sites of healthy human subjects.
IMPORTANCE This nonbiased survey indicates that the HPV community in healthy humans is much more complex than previously defined by widely used kits that are target selective for only a few high- and low-risk HPV types for cervical cancer. The importance of nononcogenic viruses in a mixed HPV infection could be for stimulating or inhibiting a coexisting oncogenic virus via viral interference or immune cross-reaction. Knowledge gained from this study will be helpful to guide the designing of epidemiological and clinical studies in the future to determine the impact of nononcogenic HPV types on the outcome of HPV infections.
Enterococcus faecalis is commonly isolated from the gastrointestinal tract of healthy infants and adults, where it contributes to host health and well-being. We describe here the draft genome sequence of E. faecalis PC1.1, a candidate probiotic strain isolated from human feces.
The complete genome sequence of the radiation resistant bacterium Deinococcus radiodurans R1 is composed of two chromosomes (2,648,615 and 412,340 basepairs), a megaplasmid (177,466 basepairs), and a small plasmid (45,702 basepairs) yielding a total genome of 3,284,123 basepairs. Multiple components distributed on the chromosomes and megaplasmid that contribute to the ability of D. radiodurans to survive under conditions of starvation, oxidative stress, and high levels of DNA-damage have been identified. D. radiodurans represents an organism in which all systems for DNA repair, DNA damage export, desiccation and starvation recovery, and genetic redundancy are present in one cell.
This manuscript calls for an international effort to generate a comprehensive catalog from genome sequences of all the archaeal and bacterial type strains.
Microbes hold the key to life. They hold the secrets to our past (as the descendants of the earliest forms of life) and the prospects for our future (as we mine their genes for solutions to some of the planet's most pressing problems, from global warming to antibiotic resistance). However, the piecemeal approach that has defined efforts to study microbial genetic diversity for over 20 years and in over 30,000 genome projects risks squandering that promise. These efforts have covered less than 20% of the diversity of the cultured archaeal and bacterial species, which represent just 15% of the overall known prokaryotic diversity. Here we call for the funding of a systematic effort to produce a comprehensive genomic catalog of all cultured Bacteria and Archaea by sequencing, where available, the type strain of each species with a validly published name (currently∼11,000). This effort will provide an unprecedented level of coverage of our planet's genetic diversity, allow for the large-scale discovery of novel genes and functions, and lead to an improved understanding of microbial evolution and function in the environment.
The gastrointestinal (GI) microbiome contributes significantly to host nutrition and health. However, relationships involving GI microbes, their hosts and host macrohabitats remain to be established. Here, we define clear patterns of variation in the GI microbiomes of six groups of Mexican black howler monkeys (Alouatta pigra) occupying a gradation of habitats including a continuous evergreen rainforest, an evergreen rainforest fragment, a continuous semi-deciduous forest and captivity. High throughput microbial 16S ribosomal RNA gene sequencing indicated that diversity, richness and composition of howler GI microbiomes varied with host habitat in relation to diet. Howlers occupying suboptimal habitats consumed less diverse diets and correspondingly had less diverse gut microbiomes. Quantitative real-time PCR also revealed a reduction in the number of genes related to butyrate production and hydrogen metabolism in the microbiomes of howlers occupying suboptimal habitats, which may impact host health.
Alouatta; gut microbiome; habitat; primate
High throughput sequencing has accelerated the determination of genome sequences for thousands of human infectious disease pathogens and dozens of their vectors. The scale and scope of these data are enabling genotype-phenotype association studies to identify genetic determinants of pathogen virulence and drug/insecticide resistance, and phylogenetic studies to track the origin and spread of disease outbreaks. To maximize the utility of genomic sequences for these purposes, it is essential that metadata about the pathogen/vector isolate characteristics be collected and made available in organized, clear, and consistent formats. Here we report the development of the GSCID/BRC Project and Sample Application Standard, developed by representatives of the Genome Sequencing Centers for Infectious Diseases (GSCIDs), the Bioinformatics Resource Centers (BRCs) for Infectious Diseases, and the U.S. National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), informed by interactions with numerous collaborating scientists. It includes mapping to terms from other data standards initiatives, including the Genomic Standards Consortium’s minimal information (MIxS) and NCBI’s BioSample/BioProjects checklists and the Ontology for Biomedical Investigations (OBI). The standard includes data fields about characteristics of the organism or environmental source of the specimen, spatial-temporal information about the specimen isolation event, phenotypic characteristics of the pathogen/vector isolated, and project leadership and support. By modeling metadata fields into an ontology-based semantic framework and reusing existing ontologies and minimum information checklists, the application standard can be extended to support additional project-specific data fields and integrated with other data represented with comparable standards. The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant.
To assess the vaginal microbiome throughout full-term uncomplicated pregnancy.
Vaginal swabs were obtained from twelve pregnant women at 8-week intervals throughout their uncomplicated pregnancies. Patients with symptoms of vaginal infection or with recent antibiotic use were excluded. Swabs were obtained from the posterior fornix and cervix at 8–12, 17–21, 27–31, and 36–38 weeks of gestation. The microbial community was profiled using hypervariable tag sequencing of the V3–V5 region of the 16S rRNA gene, producing approximately 8 million reads on the Illumina MiSeq.
Samples were dominated by a single genus, Lactobacillus, and exhibited low species diversity. For a majority of the patients (n = 8), the vaginal microbiome was dominated by Lactobacillus crispatus throughout pregnancy. Two patients showed Lactobacillus iners dominance during the course of pregnancy, and two showed a shift between the first and second trimester from L. crispatus to L. iners dominance. In all of the samples only these two species were identified, and were found at an abundance of higher than 1% in this study. Comparative analyses also showed that the vaginal microbiome during pregnancy is characterized by a marked dominance of Lactobacillus species in both Caucasian and African-American subjects. In addition, our Caucasian subject population clustered by trimester and progressed towards a common attractor while African-American women clustered by subject instead and did not progress towards a common attractor.
Our analyses indicate normal pregnancy is characterized by a microbiome that has low diversity and high stability. While Lactobacillus species strongly dominate the vaginal environment during pregnancy across the two studied ethnicities, observed differences between the longitudinal dynamics of the analyzed populations may contribute to divergent risk for pregnancy complications. This helps establish a baseline for investigating the role of the microbiome in complications of pregnancy such as preterm labor and preterm delivery.
Enterococcus faecium is commonly isolated from the human gastrointestinal tract; however, important intraspecies variations exist with relevance for host health and well-being. Here, we describe the draft genome sequence of E. faecium PC4.1, a clade B strain isolated from human feces.
The parasite Plasmodium falciparum is responsible for hundreds of millions of cases of malaria, and kills more than one million African children annually. Here we report an analysis of the genome sequence of P. falciparum clone 3D7. The 23-megabase nuclear genome consists of 14 chromosomes, encodes about 5,300 genes, and is the most (A + T)-rich genome sequenced to date. Genes involved in antigenic variation are concentrated in the subtelomeric regions of the chromosomes. Compared to the genomes of free-living eukaryotic microbes, the genome of this intracellular parasite encodes fewer enzymes and transporters, but a large proportion of genes are devoted to immune evasion and host–parasite interactions. Many nuclear-encoded proteins are targeted to the apicoplast, an organelle involved in fatty-acid and isoprenoid metabolism. The genome sequence provides the foundation for future studies of this organism, and is being exploited in the search for new drugs and vaccines to fight malaria.
Our knowledge of microbial diversity in the human oral cavity has vastly expanded during the last two decades of research. However, much of what is known about the behavior of oral species to date derives from pure culture approaches and the studies combining several cultivated species, which likely does not fully reflect their function in complex microbial communities. It has been shown in studies with a limited number of cultivated species that early oral biofilm development occurs in a successional manner and that continuous low pH can lead to an enrichment of aciduric species. Observations that in vitro grown plaque biofilm microcosms can maintain similar pH profiles in response to carbohydrate addition as plaque in vivo suggests a complex microbial community can be established in the laboratory. In light of this, our primary goal was to develop a robust in vitro biofilm-model system from a pooled saliva inoculum in order to study the stability, reproducibility, and development of the oral microbiome, and its dynamic response to environmental changes from the community to the molecular level.
Comparative metagenomic analyses confirmed a high similarity of metabolic potential in biofilms to recently available oral metagenomes from healthy subjects as part of the Human Microbiome Project. A time-series metagenomic analysis of the taxonomic community composition in biofilms revealed that the proportions of major species at 3 hours of growth are maintained during 48 hours of biofilm development. By employing deep pyrosequencing of the 16S rRNA gene to investigate this biofilm model with regards to bacterial taxonomic diversity, we show a high reproducibility of the taxonomic carriage and proportions between: 1) individual biofilm samples; 2) biofilm batches grown at different dates; 3) DNA extraction techniques and 4) research laboratories.
Our study demonstrates that we now have the capability to grow stable oral microbial in vitro biofilms containing more than one hundred operational taxonomic units (OTU) which represent 60-80% of the original inoculum OTU richness. Previously uncultivated Human Oral Taxa (HOT) were identified in the biofilms and contributed to approximately one-third of the totally captured 16S rRNA gene diversity. To our knowledge, this represents the highest oral bacterial diversity reported for an in vitro model system so far. This robust model will help investigate currently uncultivated species and the known virulence properties for many oral pathogens not solely restricted to pure culture systems, but within multi-species biofilms.
In vitro model; Biofilm; Oral microbiome; Saliva; Streptococcus; Lactobacillus; Uncultivated bacteria
Elucidating the biogeography of bacterial communities on the human body is critical for establishing healthy baselines from which to detect differences associated with disease; however, little is known about the baseline bacterial profiles from various human habitats of healthy Chinese undergraduates.
Using parallel barcoded 454 pyrosequencing targeting on the 16S rRNA gene V3 region, the bacterial diversity of the nasopharynx, saliva, dominant hands, and feces were investigated from 10 healthy Chinese junior boarding undergraduates at Zhejiang University. The participants were 21–24 years of age with a body mass index (BMI) < 24 kg/m2. A total of 156,717 high-quality pyrosequencing reads were obtained for evaluating bacterial diversity, which represented 29,887 unique phylotypes. The overall taxonomic distribution of the 16S rRNA gene-based amplicons demonstrated that these 4 habitats of the human body harbored distinct microbiota and could be divided into different clusters according to anatomic site, while the established patterns of bacterial diversity followed the human body habitat (feces, hands, saliva, and nasopharynx). Although significant inter-individual variation was observed, the healthy microbiota still shared a large number of phylotypes in each habitat, but not among the four habitats, indicating that a core microbiome existed in each healthy habitat. The vast majority of sequences from these different habitats were classified into different taxonmies that became the predominant bacteria of the healthy microbiota.
We first established the framework of microbial communities from four healthy human habitats of the same participants with similar living environments for the Chinese undergraduates. Our data represent an important step for determining the diversity of Chinese healthy microbiota, and can be used for more large-scale studies that focus on the interactions between healthy and diseases states for young Chinese adults in the same age range.
Healthy microbiota; Saliva; Nasopharynx; Feces; Dominant hand; Pyrosequencing; Core microbiome
Background & Aims
Intestinal dysbiosis and bacterial translocation is common in patients with advanced liver disease, and there is strong evidence that the translocation of bacteria and their products across the epithelial barrier drives experimental liver disease progression. The aims of our study were to investigate dynamics of bacterial translocation and changes in the enteric microbiome in early stages of liver disease.
Cholestatic liver injury was induced by ligation of the common bile duct (BDL) and toxic liver injury by injection of carbon tetrachloride (CCl4) in mice.
Increased intestinal permeability and bacterial translocation occurred one day following liver injury in both disease models. This was accompanied by decreased intestinal expression of the tight junction protein occludin. Although BDL resulted in a rapid onset of intestinal bacterial overgrowth, bacterial overgrowth was observed in mice injected with CCl4 only in advanced stages of liver fibrosis. To further assess the qualitative changes in the intestinal microbiome, massively parallel pyrosequencing of 16S rRNA genes revealed minor microbial changes following BDL, while CCl4 administration resulted in a relative abundance of Firmicutes and Actinobacteria compared with oil injected mice. Four different liver disease models (cholestasis, toxic, alcohol, obesity) show few similarities in their intestinal microbiome.
Acute liver injury is associated with an early onset of increased intestinal permeability and bacterial translocation that precede changes in the microbiome. The enteric microbiome differs with respect to the etiology of liver disease.
bacterial overgrowth; microbiome; dysbiosis; bacterial translocation; 16S rRNA sequencing
Bacterial vaginosis (BV) is the most common vaginal disorder of reproductive-age women. Yet the cause of BV has not been established. To uncover key determinants of BV, we employed a multi-omic, systems-biology approach, including both deep 16S rRNA gene-based sequencing and metabolomics of lavage samples from 36 women. These women varied demographically, behaviorally, and in terms of health status and symptoms.
16S rRNA gene-based community composition profiles reflected Nugent scores, but not Amsel criteria. In contrast, metabolomic profiles were markedly more concordant with Amsel criteria. Metabolomic profiles revealed two distinct symptomatic BV types (SBVI and SBVII) with similar characteristics that indicated disruption of epithelial integrity, but each type was correlated to the presence of different microbial taxa and metabolites, as well as to different host behaviors. The characteristic odor associated with BV was linked to increases in putrescine and cadaverine, which were both linked to Dialister spp. Additional correlations were seen with the presence of discharge, 2-methyl-2-hydroxybutanoic acid, and Mobiluncus spp., and with pain, diethylene glycol and Gardnerella spp.
The results not only provide useful diagnostic biomarkers, but also may ultimately provide much needed insight into the determinants of BV.
A variety of microbial communities and their genes (microbiome) exist throughout the human body, playing fundamental roles in human health and disease. The NIH funded Human Microbiome Project (HMP) Consortium has established a population-scale framework which catalyzed significant development of metagenomic protocols resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we present resources from a population of 242 healthy adults sampled at 15 to 18 body sites up to three times, which to date, have generated 5,177 microbial taxonomic profiles from 16S rRNA genes and over 3.5 Tb of metagenomic sequence. In parallel, approximately 800 human-associated reference genomes have been sequenced. Collectively, these data represent the largest resource to date describing the abundance and variety of the human microbiome, while providing a platform for current and future studies.
A combination of Sanger and 454 sequences of small subunit rRNA loci were used to interrogate microbial diversity in the bovine rumen of 12 cows consuming a forage diet. Observed bacterial species richness, based on the V1–V3 region of the 16S rRNA gene, was between 1,903 to 2,432 species-level operational taxonomic units (OTUs) when 5,520 reads were sampled per animal. Eighty percent of species-level OTUs were dominated by members of the order Clostridiales, Bacteroidales, Erysipelotrichales and unclassified TM7. Abundance of Prevotella species varied widely among the 12 animals. Archaeal species richness, also based on 16S rRNA, was between 8 and 13 OTUs, representing 5 genera. The majority of archaeal OTUs (84%) found in this study were previously observed in public databases with only two new OTUs discovered. Observed rumen fungal species richness, based on the 18S rRNA gene, was between 21 and 40 OTUs with 98.4–99.9% of OTUs represented by more than one read, using Good’s coverage. Examination of the fungal community identified numerous novel groups. Prevotella and Tannerella were overrepresented in the liquid fraction of the rumen while Butyrivibrio and Blautia were significantly overrepresented in the solid fraction of the rumen. No statistical difference was observed between the liquid and solid fractions in biodiversity of archaea and fungi. The survey of microbial communities and analysis of cross-domain correlations suggested there is a far greater extent of microbial diversity in the bovine rumen than previously appreciated, and that next generation sequencing technologies promise to reveal novel species, interactions and pathways that can be studied further in order to better understand how rumen microbial community structure and function affects ruminant feed efficiency, biofuel production, and environmental impact.
Comparative genomic hybridization was used to compare genetic diversity of five strains of Leptospira (Leptospira interrogans serovars Bratislava, Canicola, and Hebdomadis and Leptospira kirschneri serovars Cynopteri and Grippotyphosa). The array was designed based on two available sequenced Leptospira reference genomes, those of L. interrogans serovar Copenhageni and L. interrogans serovar Lai. A comparison of genetic contents showed that L. interrogans serovar Bratislava was closest to the reference genomes while L. kirschneri serovar Grippotyphosa had the least similarity to the reference genomes. Cluster analysis indicated that L. interrogans serovars Bratislava and Hebdomadis clustered together first, followed by L. interrogans serovar Canicola, before the two L. kirschneri strains. Confirmed/potential virulence factors identified in previous research were also detected in the tested strains.
Infections of the vaginal tract result from perturbations in the complex interactions between the microbiome and the host vaginal ecosystem. Recent data have linked specific vaginal microbes and urogenital infection with pre-term birth. Here we discuss how next generation sequencing-based approaches to study the vaginal microbiome will be important for defining what constitutes an imbalance of the microbiome and the associated host conditions that lead to subsequent infection and disease states. These studies will provide clinicians reliable diagnostic tools and treatments for women who are at increased risk for vaginal infections, preterm birth, HIV and other sexually acquired diseases, and will provide opportunities for intervention.
Clinical dogma is that healthy urine is sterile and the presence of bacteria with an inflammatory response is indicative of urinary tract infection (UTI). Asymptomatic bacteriuria (ABU) represents the state in which bacteria are present but the inflammatory response is negligible. Differentiating ABU from UTI is diagnostically challenging, but critical because overtreatment of ABU can perpetuate antimicrobial resistance while undertreatment of UTI can result in increased morbidity and mortality. In this study, we describe key characteristics of the healthy and ABU urine microbiomes utilizing 16S rRNA gene (16S rDNA) sequencing and metaproteomics, with the future goal of utilizing this information to personalize the treatment of UTI based on key individual characteristics.
A cross-sectional study of 26 healthy controls and 27 healthy subjects at risk for ABU due to spinal cord injury-related neuropathic bladder (NB) was conducted. Of the 27 subjects with NB, 8 voided normally, 8 utilized intermittent catheterization, and 11 utilized indwelling Foley urethral catheterization for bladder drainage. Urine was obtained by clean catch in voiders, or directly from the catheter in subjects utilizing catheters. Urinalysis, urine culture and 16S rDNA sequencing were performed on all samples, with metaproteomic analysis performed on a subsample.
A total of 589454 quality-filtered 16S rDNA sequence reads were processed through a NextGen 16S rDNA analysis pipeline. Urine microbiomes differ by normal bladder function vs. NB, gender, type of bladder catheter utilized, and duration of NB. The top ten bacterial taxa showing the most relative abundance and change among samples were Lactobacillales, Enterobacteriales, Actinomycetales, Bacillales, Clostridiales, Bacteroidales, Burkholderiales, Pseudomonadales, Bifidobacteriales and Coriobacteriales. Metaproteomics confirmed the 16S rDNA results, and functional human protein-pathogen interactions were noted in subjects where host defenses were initiated.
Counter to clinical belief, healthy urine is not sterile. The healthy urine microbiome is characterized by a preponderance of Lactobacillales in women and Corynebacterium in men. The presence and duration of NB and method of urinary catheterization alter the healthy urine microbiome. An integrated approach of 16S rDNA sequencing with metaproteomics improves our understanding of healthy urine and facilitates a more personalized approach to prevention and treatment of infection.
Bacteriuria; Urine; Catheter; Neuropathic; Bladder; Microbiome; Metaproteome; Next-generation; Personalized; rRNA
This manuscript describes the NIH Human Microbiome Project, including a brief review of human microbiome research, a history of the project, and a comprehensive overview of the consortium's recent collection of publications analyzing the human microbiome.
Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences—the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The ‘environmental packages’ apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and to provide a single point of entry for the scientific community to access and learn about GSC checklists, we present the minimum information about any (x) sequence (MIxS). Adoption of MIxS will enhance our ability to analyze natural genetic diversity documented by massive DNA sequencing efforts from myriad ecosystems in our ever-changing biosphere.
This study is the first to use a metagenomics approach to characterize the phylogeny and functional capacity of the canine gastrointestinal microbiome. Six healthy adult dogs were used in a crossover design and fed a low-fiber control diet (K9C) or one containing 7.5% beet pulp (K9BP). Pooled fecal DNA samples from each treatment were subjected to 454 pyrosequencing, generating 503 280 (K9C) and 505 061 (K9BP) sequences. Dominant bacterial phyla included the Bacteroidetes/Chlorobi group and Firmicutes, both of which comprised ∼35% of all sequences, followed by Proteobacteria (13–15%) and Fusobacteria (7–8%). K9C had a greater percentage of Bacteroidetes, Fusobacteria and Proteobacteria, whereas K9BP had greater proportions of the Bacteroidetes/Chlorobi group and Firmicutes. Archaea were not altered by diet and represented ∼1% of all sequences. All archaea were members of Crenarchaeota and Euryarchaeota, with methanogens being the most abundant and diverse. Three fungi phylotypes were present in K9C, but none in K9BP. Less than 0.4% of sequences were of viral origin, with >99% of them associated with bacteriophages. Primary functional categories were not significantly affected by diet and were associated with carbohydrates; protein metabolism; DNA metabolism; cofactors, vitamins, prosthetic groups and pigments; amino acids and derivatives; cell wall and capsule; and virulence. Hierarchical clustering of several gastrointestinal metagenomes demonstrated phylogenetic and metabolic similarity between dogs, humans and mice. More research is required to provide deeper coverage of the canine microbiome, evaluate effects of age, genetics or environment on its composition and activity, and identify its role in gastrointestinal disease.
canine gut; gastrointestinal bacteria; metagenomics; pyrosequencing
Thermotoga sp. strain RQ2 is probably a strain of Thermotoga maritima. Its complete genome sequence allows for an examination of the extent and consequences of gene flow within Thermotoga species and strains. Thermotoga sp. RQ2 differs from T. maritima in its genes involved in myo-inositol metabolism. Its genome also encodes an apparent fructose phosphotransferase system (PTS) sugar transporter. This operon is also found in Thermotoga naphthophila strain RKU-10 but no other Thermotogales. These are the first reported PTS transporters in the Thermotogales.
A steep drop in the cost of next-generation sequencing during recent years has made the technology affordable to the majority of researchers, but downstream bioinformatic analysis still poses a resource bottleneck for smaller laboratories and institutes that do not have access to substantial computational resources. Sequencing instruments are typically bundled with only the minimal processing and storage capacity required for data capture during sequencing runs. Given the scale of sequence datasets, scientific value cannot be obtained from acquiring a sequencer unless it is accompanied by an equal investment in informatics infrastructure.
Cloud BioLinux is a publicly accessible Virtual Machine (VM) that enables scientists to quickly provision on-demand infrastructures for high-performance bioinformatics computing using cloud platforms. Users have instant access to a range of pre-configured command line and graphical software applications, including a full-featured desktop interface, documentation and over 135 bioinformatics packages for applications including sequence alignment, clustering, assembly, display, editing, and phylogeny. Each tool's functionality is fully described in the documentation directly accessible from the graphical interface of the VM. Besides the Amazon EC2 cloud, we have started instances of Cloud BioLinux on a private Eucalyptus cloud installed at the J. Craig Venter Institute, and demonstrated access to the bioinformatic tools interface through a remote connection to EC2 instances from a local desktop computer. Documentation for using Cloud BioLinux on EC2 is available from our project website, while a Eucalyptus cloud image and VirtualBox Appliance is also publicly available for download and use by researchers with access to private clouds.
Cloud BioLinux provides a platform for developing bioinformatics infrastructures on the cloud. An automated and configurable process builds Virtual Machines, allowing the development of highly customized versions from a shared code base. This shared community toolkit enables application specific analysis platforms on the cloud by minimizing the effort required to prepare and maintain them.
Although Bacteroides vulgatus is one of the most prevalent microorganisms in the human gastrointestinal tract, little is known about the genetic potential of this species. Here, we describe the annotated draft genome sequence of B. vulgatus PC510 isolated from human feces.