The sequencing, assembly, and basic analysis of microbial genomes, once a painstaking and expensive undertaking, has become much easier for research labs with access to standard molecular biology and computational tools. However, there are a confusing variety of options available for DNA library preparation and sequencing, and inexperience with bioinformatics can pose a significant barrier to entry for many who may be interested in microbial genomics. The objective of the present study was to design, test, troubleshoot, and publish a simple, comprehensive workflow from the collection of an environmental sample (a swab) to a published microbial genome; empowering even a lab or classroom with limited resources and bioinformatics experience to perform it.
Workflow; Microbial genomics; Genome sequencing; Genome assembly; Bioinformatics
Microbial interaction between human-associated objects and the environments we inhabit may have forensic implications, and the extent to which microbes are shared between individuals inhabiting the same space may be relevant to human health and disease transmission. In this study, two participants sampled the front and back of their cell phones, four different locations on the soles of their shoes, and the floor beneath them every waking hour over a 2-day period. A further 89 participants took individual samples of their shoes and phones at three different scientific conferences.
Samples taken from different surface types maintained significantly different microbial community structures. The impact of the floor microbial community on that of the shoe environments was strong and immediate, as evidenced by Procrustes analysis of shoe replicates and significant correlation between shoe and floor samples taken at the same time point. Supervised learning was highly effective at determining which participant had taken a given shoe or phone sample, and a Bayesian method was able to determine which participant had taken each shoe sample based entirely on its similarity to the floor samples. Both shoe and phone samples taken by conference participants clustered into distinct groups based on location, though much more so when an unweighted distance metric was used, suggesting sharing of low-abundance microbial taxa between individuals inhabiting the same space.
Correlations between microbial community sources and sinks allow for inference of the interactions between humans and their environment.
Electronic supplementary material
The online version of this article (doi:10.1186/s40168-015-0082-9) contains supplementary material, which is available to authorized users.
Forensic microbiology; Source-sink dynamics; Shoe microbiome; Phone microbiome; Microbial time series
We present the draft genome sequences for 26 strains of Porphyromonas (P. canoris, P. gulae, P. cangingavalis, P. macacae, and 7 unidentified) and an unidentified member of the Porphyromonadaceae family. All of these strains were isolated from the canine oral cavity, from dogs with and without early periodontal disease.
Here, we present the complete 2,003,803-bp genome of a sulfate-reducing thermophilic bacterium, Thermodesulfovibrio yellowstonii strain DSM 11347T.
Here, we present the complete genome sequence of Thermodesulfobacterium commune DSM 2178T of the phylum Thermodesulfobacteria.
Here, we present the draft genome of Burkholderia gladioli strain UCD-UG_CHAPALOTE. This strain is an endophyte isolated from surface sterilized seeds of an ancient Mexican landrace of corn, Chapalote. The genome contains 8,527,129 bp in 109 scaffolds.
Here, we present the draft genome of Enterobacter sp. strain UCD-UG_FMILLET. This strain is an endophyte isolated from the roots of finger millet, an Afro-Indian cereal crop. The genome contains 4,801,411 bp in 53 scaffolds.
Objective: The emergence of extended-spectrum β-lactamase (ESBL)-producing bacteria is now a critical concern. The ESBL-producing Klebsiella pneumoniae constitutes one of the most common multidrug-resistant (MDR) groups of gram-negative bacteria involved in nosocomial infections worldwide. In this study we report on the molecular characterization through whole genome sequencing of an ESBL-producing K. pneumoniae strain, LAU-KP1, isolated from a stool sample from a patient admitted for a gastrointestinal procedure/surgery at the Lebanese Amrican University Medical Center-Rizk Hospital (LAUMCRH) in Lebanon.
Methods: Illumina paired-end libraries were prepared and sequenced, which resulted in 4,220,969 high-quality reads. All sequence processing and assembly were performed using the A5 assembly pipeline.
Results: The initial assembly produced 86 contigs, for which no scaffolding was obtained. The final collection of contigs was submitted to GenBank. The final draft genome sequence consists of a combined 5,632,663 bases with 57% G+C content. Automated annotation was performed using the RAST annotation server. Sequencing analysis revealed that the isolate harbored different β-lactamase genes, including blaoxa−1, blaCTX−M−15, blaSHV−11, and blaTEM−1b. The isolate was also characterized by the concomitant presence of other resistance determinants most notably acc(6′)-lb-cr and qnrb1. The entire plasmid content was also investigated and revealed homology with four major plasmids pKPN-IT, pBS512_2, pRSF1010_SL1344, and pKPN3.
Conclusions: The potential role of K. pneumonia as a reservoir for ESBL genes and other resistance determinants is along with the presence of key factors that favor the spread of antimicrobial resistance a clear cause of concern and the problem that Carbapenem-non-susceptible ESBL isolates are posing in hospitals should be reconsidered through systematic exploration and molecular characterization.
ESBL; Klebsiella pneumoniae; whole genome sequencing; CTX-M-15; SHV-11
The Genomic Encyclopedia of Bacteria and Archaea (GEBA) project was launched by the JGI in 2007 as a pilot project to sequence about 250 bacterial and archaeal genomes of elevated phylogenetic diversity. Herein, we propose to extend this approach to type strains of prokaryotes associated with soil or plants and their close relatives as well as type strains from newly described species. Understanding the microbiology of soil and plants is critical to many DOE mission areas, such as biofuel production from biomass, biogeochemistry, and carbon cycling. We are also targeting type strains of novel species while they are being described. Since 2006, about 630 new species have been described per year, many of which are closely aligned to DOE areas of interest in soil, agriculture, degradation of pollutants, biofuel production, biogeochemical transformation, and biodiversity.
Genome sequencing; Type stains; Prokaryotes
We present the draft genome sequences of six strains of Escherichia coli isolated from blood cultures collected from patients with sepsis. The strains were collected from two patient sets, those with a high severity of illness, and those with a low severity of illness. Each genome was sequenced by both Illumina and PacBio for comparison.
Far more attention has been paid to the microbes in our feces than the microbes in our food. Research efforts dedicated to the microbes that we eat have historically been focused on a fairly narrow range of species, namely those which cause disease and those which are thought to confer some “probiotic” health benefit. Little is known about the effects of ingested microbial communities that are present in typical American diets, and even the basic questions of which microbes, how many of them, and how much they vary from diet to diet and meal to meal, have not been answered.
We characterized the microbiota of three different dietary patterns in order to estimate: the average total amount of daily microbes ingested via food and beverages, and their composition in three daily meal plans representing three different dietary patterns. The three dietary patterns analyzed were: (1) the Average American (AMERICAN): focused on convenience foods, (2) USDA recommended (USDA): emphasizing fruits and vegetables, lean meat, dairy, and whole grains, and (3) Vegan (VEGAN): excluding all animal products. Meals were prepared in a home kitchen or purchased at restaurants and blended, followed by microbial analysis including aerobic, anaerobic, yeast and mold plate counts as well as 16S rRNA PCR survey analysis.
Based on plate counts, the USDA meal plan had the highest total amount of microbes at 1.3 × 109 CFU per day, followed by the VEGAN meal plan and the AMERICAN meal plan at 6 × 106 and 1.4 × 106 CFU per day respectively. There was no significant difference in diversity among the three dietary patterns. Individual meals clustered based on taxonomic composition independent of dietary pattern. For example, meals that were abundant in Lactic Acid Bacteria were from all three dietary patterns. Some taxonomic groups were correlated with the nutritional content of the meals. Predictive metagenome analysis using PICRUSt indicated differences in some functional KEGG categories across the three dietary patterns and for meals clustered based on whether they were raw or cooked.
Further studies are needed to determine the impact of ingested microbes on the intestinal microbiota, the extent of variation across foods, meals and diets, and the extent to which dietary microbes may impact human health. The answers to these questions will reveal whether dietary microbes, beyond probiotics taken as supplements—i.e., ingested with food—are important contributors to the composition, inter-individual variation, and function of our gut microbiota.
16S; Microbial ecology; Microbiota; Microbiome; Bioinformatics; Microbial communities; Food microbiology; QIIME; PICRUSt; Illumina amplicon sequencing
Organisms across the tree of life use a variety of mechanisms to respond to stress-inducing fluctuations in osmotic conditions. Cellular response mechanisms and phenotypes associated with osmoadaptation also play important roles in bacterial virulence, human health, agricultural production and many other biological systems. To improve understanding of osmoadaptive strategies, we have generated 59 high-quality draft genomes for the haloarchaea (a euryarchaeal clade whose members thrive in hypersaline environments and routinely experience drastic changes in environmental salinity) and analyzed these new genomes in combination with those from 21 previously sequenced haloarchaeal isolates. We propose a generalized model for haloarchaeal management of cytoplasmic osmolarity in response to osmotic shifts, where potassium accumulation and sodium expulsion during osmotic upshock are accomplished via secondary transport using the proton gradient as an energy source, and potassium loss during downshock is via a combination of secondary transport and non-specific ion loss through mechanosensitive channels. We also propose new mechanisms for magnesium and chloride accumulation. We describe the expansion and differentiation of haloarchaeal general transcription factor families, including two novel expansions of the TATA-binding protein family, and discuss their potential for enabling rapid adaptation to environmental fluxes. We challenge a recent high-profile proposal regarding the evolutionary origins of the haloarchaea by showing that inclusion of additional genomes significantly reduces support for a proposed large-scale horizontal gene transfer into the ancestral haloarchaeon from the bacterial domain. The combination of broad (17 genera) and deep (≥5 species in four genera) sampling of a phenotypically unified clade has enabled us to uncover both highly conserved and specialized features of osmoadaptation. Finally, we demonstrate the broad utility of such datasets, for metagenomics, improvements to automated gene annotation and investigations of evolutionary processes.
The ability to adjust to changing osmotic conditions (osmoadaptation) is crucial to the survival of organisms across the tree of life. However, significant gaps still exist in our understanding of this important phenomenon. To help fill some of these gaps, we have produced high-quality draft genomes for 59 osmoadaptation “experts” (extreme halophiles of the euryarchaeal family Halobacteriaceae). We describe the dispersal of osmoadaptive protein families across the haloarchaeal evolutionary tree. We use this data to suggest a generalized model for haloarchaeal ion transport in response to changing osmotic conditions, including proposed new mechanisms for magnesium and chloride accumulation. We describe the evolutionary expansion and differentiation of haloarchaeal general transcription factor families and discuss their potential for enabling rapid adaptation to environmental fluxes. Lastly, we challenge a recent high-profile proposal regarding the evolutionary origins of the haloarchaea by showing that inclusion of additional genomes significantly reduces support for a proposed large-scale horizontal gene transfer into the ancestral haloarchaeon from the bacterial domain. This result highlights the power of our dataset for making evolutionary inferences, a feature which will make it useful to the broader evolutionary community. We distribute our genomic dataset through a user-friendly graphical interface.
Symbioses between chemoautotrophic bacteria and marine invertebrates are rare examples of living systems that are virtually independent of photosynthetic primary production. These associations have evolved multiple times in marine habitats, such as deep-sea hydrothermal vents and reducing sediments, characterized by steep gradients of oxygen and reduced chemicals. Due to difficulties associated with maintaining these symbioses in the laboratory and culturing the symbiotic bacteria, studies of chemosynthetic symbioses rely heavily on culture independent methods. The symbiosis between the coastal bivalve, Solemya velum, and its intracellular symbiont is a model for chemosynthetic symbioses given its accessibility in intertidal environments and the ability to maintain it under laboratory conditions. To better understand this symbiosis, the genome of the S. velum endosymbiont was sequenced.
Relative to the genomes of obligate symbiotic bacteria, which commonly undergo erosion and reduction, the S. velum symbiont genome was large (2.7 Mb), GC-rich (51%), and contained a large number (78) of mobile genetic elements. Comparative genomics identified sets of genes specific to the chemosynthetic lifestyle and necessary to sustain the symbiosis. In addition, a number of inferred metabolic pathways and cellular processes, including heterotrophy, branched electron transport, and motility, suggested that besides the ability to function as an endosymbiont, the bacterium may have the capacity to live outside the host.
The physiological dexterity indicated by the genome substantially improves our understanding of the genetic and metabolic capabilities of the S. velum symbiont and the breadth of niches the partners may inhabit during their lifecycle.
Electronic supplementary material
The online version of this article (doi:10.1186/1471-2164-15-924) contains supplementary material, which is available to authorized users.
Symbiosis; Chemosynthesis; Sulfur oxidation; Respiratory flexibility; H+/Na+ -membrane cycles; Calvin cycle; Pyrophosphate-dependent phosphofructokinase; Heterotrophy; Motility; Mobile genetic elements
The complete genome sequence of the radiation resistant bacterium Deinococcus radiodurans R1 is composed of two chromosomes (2,648,615 and 412,340 basepairs), a megaplasmid (177,466 basepairs), and a small plasmid (45,702 basepairs) yielding a total genome of 3,284,123 basepairs. Multiple components distributed on the chromosomes and megaplasmid that contribute to the ability of D. radiodurans to survive under conditions of starvation, oxidative stress, and high levels of DNA-damage have been identified. D. radiodurans represents an organism in which all systems for DNA repair, DNA damage export, desiccation and starvation recovery, and genetic redundancy are present in one cell.
Here we present the draft genome of Synergistes jonesii 78-1, ATCC 49833, a member of the Synergistes phylum. This organism was isolated from the rumen of a Hawaiian goat and ferments pyridinediols. The assembly contains 2,747,397 bp in 61 contigs.
This manuscript calls for an international effort to generate a comprehensive catalog from genome sequences of all the archaeal and bacterial type strains.
Microbes hold the key to life. They hold the secrets to our past (as the descendants of the earliest forms of life) and the prospects for our future (as we mine their genes for solutions to some of the planet's most pressing problems, from global warming to antibiotic resistance). However, the piecemeal approach that has defined efforts to study microbial genetic diversity for over 20 years and in over 30,000 genome projects risks squandering that promise. These efforts have covered less than 20% of the diversity of the cultured archaeal and bacterial species, which represent just 15% of the overall known prokaryotic diversity. Here we call for the funding of a systematic effort to produce a comprehensive genomic catalog of all cultured Bacteria and Archaea by sequencing, where available, the type strain of each species with a validly published name (currently∼11,000). This effort will provide an unprecedented level of coverage of our planet's genetic diversity, allow for the large-scale discovery of novel genes and functions, and lead to an improved understanding of microbial evolution and function in the environment.
Here, we present the draft genome of the endosymbiont “Candidatus Ruthia magnifica” UCD-CM, a member of the phylum Proteobacteria, found from the gills of a deep-sea giant clam, Calyptogena magnifica. The assembly consists of 1,160,249 bp contained in 18 contigs.
The Microbiology of the Built Environment Network (microBEnet) has served as an experiment in online community building. Here we discuss strategies used to launch a new, interdisciplinary scientific field, and their implications.
Thermotoga thermarum Windberger et al. 1989 is a member to the genomically well characterized genus Thermotoga in the phylum ‘Thermotogae’. T. thermarum is of interest for its origin from a continental solfataric spring vs. predominantly marine oil reservoirs of other members of the genus. The genome of strain LA3T also provides fresh data for the phylogenomic positioning of the (hyper-)thermophilic bacteria. T. thermarum strain LA3T is the fourth sequenced genome of a type strain from the genus Thermotoga, and the sixth in the family Thermotogaceae to be formally described in a publication. Phylogenetic analyses do not reveal significant discrepancies between the current classification of the group, 16S rRNA gene data and whole-genome sequences. Nevertheless, T. thermarum significantly differs from other Thermotoga species regarding its iron-sulfur cluster synthesis, as it contains only a minimal set of the necessary proteins. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,039,943 bp long chromosome with its 2,015 protein-coding and 51 RNA genes is a part of the Genomic Encyclopedia of Bacteria and Archaea project.
anaerobic; motile; thermophilic; chemoorganotrophic; solfataric spring; outer sheath-like structure; Thermotogaceae; GEBA
Metagenomics is a valuable tool for the study of microbial communities but has been limited by the difficulty of “binning” the resulting sequences into groups corresponding to the individual species and strains that constitute the community. Moreover, there are presently no methods to track the flow of mobile DNA elements such as plasmids through communities or to determine which of these are co-localized within the same cell. We address these limitations by applying Hi-C, a technology originally designed for the study of three-dimensional genome structure in eukaryotes, to measure the cellular co-localization of DNA sequences. We leveraged Hi-C data generated from a simple synthetic metagenome sample to accurately cluster metagenome assembly contigs into groups that contain nearly complete genomes of each species. The Hi-C data also reliably associated plasmids with the chromosomes of their host and with each other. We further demonstrated that Hi-C data provides a long-range signal of strain-specific genotypes, indicating such data may be useful for high-resolution genotyping of microbial populations. Our work demonstrates that Hi-C sequencing data provide valuable information for metagenome analyses that are not currently obtainable by other methods. This metagenomic Hi-C method could facilitate future studies of the fine-scale population structure of microbes, as well as studies of how antibiotic resistance plasmids (or other genetic elements) mobilize in microbial communities. The method is not limited to microbiology; the genetic architecture of other heterogeneous populations of cells could also be studied with this technique.
Hi-C; Microbial ecology; Metagenomics; Plasmids; Synthetic microbial communities; Markov clustering; Metagenome assembly; Strain differentiation; Haplotype phasing; Genome scaffolding
We present the draft genome sequences of nine clinical Streptococcus pyogenes isolates recovered from patients suffering from sore throat and skin infections. An average of 2,454,334 paired-end reads per sample were generated, which assembled into 21 to 198 contigs, with a G+C content of 38.4 to 38.5%.
Here we present the complete 1,424,912-bp genome sequence of Coprothermobacter proteolyticus DSM 5265, isolated from a thermophilic digester fermenting tannery wastes and cattle manure.
Here we present the draft genome of Tatumella sp. strain UCD-D_suzukii, the first member of this genus to be sequenced. The genome contains 3,602,931 bp in 72 scaffolds. This strain was isolated from Drosophila suzukii larvae as part of a larger project to study the microbiota of D. suzukii.
Methanoplanus limicola Wildgruber et al. 1984 is a mesophilic methanogen that was isolated from a swamp composed of drilling waste near Naples, Italy, shortly after the Archaea were recognized as a separate domain of life. Methanoplanus is the type genus in the family Methanoplanaceae, a taxon that felt into disuse since modern 16S rRNA gene sequences-based taxonomy was established. Methanoplanus is now placed within the Methanomicrobiaceae, a family that is so far poorly characterized at the genome level. The only other type strain of the genus with a sequenced genome, Methanoplanus petrolearius SEBR 4847T, turned out to be misclassified and required reclassification to Methanolacinia. Both, Methanoplanus and Methanolacinia, needed taxonomic emendations due to a significant deviation of the G+C content of their genomes from previously published (pre-genome-sequence era) values. Until now genome sequences were published for only four of the 33 species with validly published names in the Methanomicrobiaceae. Here we describe the features of M. limicola, together with the improved-high-quality draft genome sequence and annotation of the type strain, M3T. The 3,200,946 bp long chromosome (permanent draft sequence) with its 3,064 protein-coding and 65 RNA genes is a part of the Genomic
anaerobic; motile; mesophilic; methanogen; swamp; improved-high-quality draft; Methanomicrobiaceae; GEBA
Here, we present the complete genome of the extreme thermophile, Dictyoglomus thermophilum H-6-12 (phylum Dictyoglomi), which consists of 1,959,987 bp.