Zymomonas mobilis is an ethanologenic bacterium that has been studied for use in biofuel production. Of the sequenced Zymomonas strains, ATCC 29191 has been described as the phenotypic centrotype of Zymomonas mobilis subsp. mobilis, the taxon that harbors the highest ethanol-producing Z. mobilis strains. ATCC 29191 was isolated in Kinshasa, Congo, from palm wine fermentations. This strain is reported to be a robust levan producer, while in recent years it has been employed in studies addressing Z. mobilis respiration. Here we announce the finishing and annotation of the ATCC 29191 genome, which comprises one chromosome and three plasmids.
Marinitoga piezophila KA3 is a thermophilic, anaerobic, chemoorganotrophic, sulfur-reducing bacterium isolated from the Grandbonum deep-sea hydrothermal vent site at the East Pacific Rise (13°N, 2,630-m depth). The genome of M. piezophila KA3 comprises a 2,231,407-bp circular chromosome and a 13,386-bp circular plasmid. This genome was sequenced within Department of Energy Joint Genome Institute CSP 2010.
Desulfotomaculum kuznetsovii is a moderately thermophilic member of the polyphyletic spore-forming genus Desulfotomaculum in the family Peptococcaceae. This species is of interest because it originates from deep subsurface thermal mineral water at a depth of about 3,000 m. D. kuznetsovii is a rather versatile bacterium as it can grow with a large variety of organic substrates, including short-chain and long-chain fatty acids, which are degraded completely to carbon dioxide coupled to the reduction of sulfate. It can grow methylotrophically with methanol and sulfate and autotrophically with H2 + CO2 and sulfate. For growth it does not require any vitamins. Here, we describe the features of D. kuznetsovii together with the genome sequence and annotation. The chromosome has 3,601,386 bp organized in one contig. A total of 3,567 candidate protein-encoding genes and 58 RNA genes were identified. Genes of the acetyl-CoA pathway, possibly involved in heterotrophic growth with acetate and methanol, and in CO2 fixation during autotrophic growth are present. Genomic comparison revealed that D. kuznetsovii shows a high similarity with Pelotomaculum thermopropionicum. Genes involved in propionate metabolism of these two strains show a strong similarity. However, main differences are found in genes involved in the electron acceptor metabolism.
Thermophilic spore-forming anaerobes; sulfate reduction; autotrophic; methylotrophic; Peptococcaceae; Clostridiales
Termites effectively feed on many types of lignocellulose assisted by their gut microbial symbionts. To better understand the microbial decomposition of biomass with varied chemical profiles, it is important to determine whether termites harbor different microbial symbionts with specialized functionalities geared toward different feeding regimens. In this study, we compared the microbiota in the hindgut paunch of Amitermes wheeleri collected from cow dung and Nasutitermes corniger feeding on sound wood by 16S rRNA pyrotag, comparative metagenomic and metatranscriptomic analyses. We found that Firmicutes and Spirochaetes were the most abundant phyla in A. wheeleri, in contrast to N. corniger where Spirochaetes and Fibrobacteres dominated. Despite this community divergence, a convergence was observed for functions essential to termite biology including hydrolytic enzymes, homoacetogenesis and cell motility and chemotaxis. Overrepresented functions in A. wheeleri relative to N. corniger microbiota included hemicellulose breakdown and fixed-nitrogen utilization. By contrast, glycoside hydrolases attacking celluloses and nitrogen fixation genes were overrepresented in N. corniger microbiota. These observations are consistent with dietary differences in carbohydrate composition and nutrient contents, but may also reflect the phylogenetic difference between the hosts.
Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least about 20 × coverage. At lower coverage, however, the derived assemblies contained a substantial fraction of non-target sequences (chimeras), which explains, at least in part, the higher number of hypothetical genes recovered in metagenomic relative to genomic projects. We also provide examples of how to detect intrapopulation structure in metagenomic datasets and estimate the type and frequency of errors in assembled genes and contigs from datasets of varied species complexity.
metagenome; assembly; Illumina
Serratia sp. strain FGI 94 was isolated from a fungus garden of the leaf-cutter ant Atta colombica. Analysis of its 4.86-Mbp chromosome will help advance our knowledge of symbiotic interactions and plant biomass degradation in this ancient ant-fungus mutualism.
The Enterobacteriaceae bacterium strain FGI 57 was isolated from a fungus garden of the leaf-cutter ant Atta colombica. Analysis of its single 4.76-Mbp chromosome will shed light on community dynamics and plant biomass degradation in ant fungus gardens.
Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG) family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/.
Metagenome analysis of the gut symbionts of three different insects was conducted as a means of comparing taxonomic and metabolic diversity of gut microbiomes to diet and life history of the insect hosts. A second goal was the discovery of novel biocatalysts for biorefinery applications. Grasshopper and cutworm gut symbionts were sequenced and compared with the previously identified metagenome of termite gut microbiota. These insect hosts represent three different insect orders and specialize on different food types. The comparative analysis revealed dramatic differences among the three insect species in the abundance and taxonomic composition of the symbiont populations present in the gut. The composition and abundance of symbionts was correlated with their previously identified capacity to degrade and utilize the different types of food consumed by their hosts. The metabolic reconstruction revealed that the gut metabolome of cutworms and grasshoppers was more enriched for genes involved in carbohydrate metabolism and transport than wood-feeding termite, whereas the termite gut metabolome was enriched for glycosyl hydrolase (GH) enzymes relevant to lignocellulosic biomass degradation. Moreover, termite gut metabolome was more enriched with nitrogen fixation genes than those of grasshopper and cutworm gut, presumably due to the termite's adaptation to the high fiber and less nutritious food types. In order to evaluate and exploit the insect symbionts for biotechnology applications, we cloned and further characterized four biomass-degrading enzymes including one endoglucanase and one xylanase from both the grasshopper and cutworm gut symbionts. The results indicated that the grasshopper symbiont enzymes were generally more efficient in biomass degradation than the homologous enzymes from cutworm symbionts. Together, these results demonstrated a correlation between the composition and putative metabolic functionality of the gut microbiome and host diet, and suggested that this relationship could be exploited for the discovery of symbionts and biocatalysts useful for biorefinery applications.
The symbiotic gut microbiome of herbivorous insects is vital for their ability to utilize and specialize on plants with very different nutrient qualities. Moreover, the gut microbiome is a significant resource for the discovery of biocatalysts and microbes with applications to various biotechnologies. We compared the gut symbionts from three different insect species to examine whether there was a relationship between the diversity and metabolic capability of the symbionts and the diet of their hosts, with the goal of using such a relationship for the discovery of biocatalysts for biofuel applications. The study revealed that the metabolic capabilities of the insect gut symbionts correlated with insect adaptation to different food types and life histories at the levels of species, metabolic pathway, and individual gene. Moreover, we showed that the grasshopper cellulase and xylanase enzymes generally exhibited higher activities than those of cutworm, demonstrating differences in capabilities even at the protein level. Together, our findings confirmed our previous research and suggested that the grasshopper might be a good target for biocatalyst discovery due to their high gut cellulytic enzyme activities.
Marinomonas posidonica IVIA-Po-181T Lucas-Elío et al. 2011 belongs to the family Oceanospirillaceae within the phylum Proteobacteria. Different species of the genus Marinomonas can be readily isolated from the seagrass Posidonia oceanica. M. posidonica is among the most abundant species of the genus detected in the cultured microbiota of P. oceanica, suggesting a close relationship with this plant, which has a great ecological value in the Mediterranean Sea, covering an estimated surface of 38,000 Km2. Here we describe the genomic features of M. posidonica. The 3,899,940 bp long genome harbors 3,544 protein-coding genes and 107 RNA genes and is a part of the Genomic
Aerobic; Gram-negative; marine; plant-associated
Serratia plymuthica AS13 is a plant-associated Gammaproteobacteria, isolated from rapeseed roots. It is of special interest because of its ability to inhibit fungal pathogens of rapeseed and to promote plant growth. The complete genome of S. plymuthica AS13 consists of a 5,442,549 bp circular chromosome. The chromosome contains 4,951 protein-coding genes, 87 tRNA genes and 7 rRNA operons. This genome was sequenced as part of the project entitled “Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on plant pathogens” within the 2010 DOE-JGI Community Sequencing Program (CSP2010).
Gram-negative; non-sporulating; motile; plant-associated; chemoorganotrophic; Enterobacteriaceae
A protein fraction exhibiting 1-hydroxy-2-naphthoic acid (1-H2NA) dioxygenase activity was purified via ion exchange, hydrophobic interactions, and gel filtration chromatography from Arthrobacter phenanthrenivorans sp. nov. strain Sphe3 isolated from a Greek creosote-oil-polluted site. Matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) and tandem MS (MS-MS) analysis revealed that the amino acid sequences of oligopeptides of the major 45-kDa protein species, as analyzed by SDS-PAGE and silver staining, comprising 29% of the whole sequence, exhibited strong homology with 1-H2NA dioxygenase of Nocardioides sp. strain KP7. A BLAST search of the recently sequenced Sphe3 genome revealed two putative open reading frames, named diox1 and diox2, showing 90% nucleotide identity to each other and 85% identity at the amino acid level with the Nocardia sp. homologue. diox1 was found on an indigenous Sphe3 plasmid, whereas diox2 was located on the chromosome. Both genes were induced by the presence of phenanthrene used as a sole carbon and energy source, and as expected, both were subject to carbon catabolite repression. The relative RNA transcription level of the chromosomal (diox2) gene was significantly higher than that of its plasmid (diox1) homologue. Both diox1 and diox2 putative genes were PCR amplified, cloned, and overexpressed in Escherichia coli. Recombinant E. coli cells expressed 1-H2NA dioxygenase activity. Recombinant enzymes exhibited Michaelis-Menten kinetics with an apparent Km of 35 μM for Diox1 and 29 μM for Diox2, whereas they showed similar kinetic turnover characteristics with Kcat/Km values of 11 × 106 M−1 s−1 and 12 × 106 M−1 s−1, respectively. Occurrence of two diox1 and diox2 homologues in the Sphe3 genome implies that a replicative transposition event has contributed to the evolution of 1-H2NA dioxygenase in A. phenanthrenivorans.
Variability in the extent of the descriptions of data (‘metadata’) held in public repositories forces users to assess the quality of records individually, which rapidly becomes impractical. The scoring of records on the richness of their description provides a simple, objective proxy measure for quality that enables filtering that supports downstream analysis. Pivotally, such descriptions should spur on improvements. Here, we introduce such a measure - the ‘Metadata Coverage Index’ (MCI): the percentage of available fields actually filled in a record or description. MCI scores can be calculated across a database, for individual records or for their component parts (e.g., fields of interest). There are many potential uses for this simple metric: for example; to filter, rank or search for records; to assess the metadata availability of an ad hoc collection; to determine the frequency with which fields in a particular record type are filled, especially with respect to standards compliance; to assess the utility of specific tools and resources, and of data capture practice more generally; to prioritize records for further curation; to serve as performance metrics of funded projects; or to quantify the value added by curation. Here we demonstrate the utility of MCI scores using metadata from the Genomes Online Database (GOLD), including records compliant with the ‘Minimum Information about a Genome Sequence’ (MIGS) standard developed by the Genomic Standards Consortium. We discuss challenges and address the further application of MCI scores; to show improvements in annotation quality over time, to inform the work of standards bodies and repository providers on the usability and popularity of their products, and to assess and credit the work of curators. Such an index provides a step towards putting metadata capture practices and in the future, standards compliance, into a quantitative and objective framework.
Saccharomonospora marina Liu et al. 2010 is a member of the genus Saccharomonospora, in the family Pseudonocardiaceae that is poorly characterized at the genome level thus far. Members of the genus Saccharomonospora are of interest because they originate from diverse habitats, such as leaf litter, manure, compost, surface of peat, moist, over-heated grain, and ocean sediment, where they might play a role in the primary degradation of plant material by attacking hemicellulose. Organisms belonging to the genus are usually Gram-positive staining, non-acid fast, and classify among the actinomycetes. Here we describe the features of this organism, together with the complete genome sequence (permanent draft status), and annotation. The 5,965,593 bp long chromosome with its 5,727 protein-coding and 57 RNA genes was sequenced as part of the DOE funded Community Sequencing Program (CSP) 2010 at the Joint Genome Institute (JGI).
aerobic; chemoheterotrophic; Gram-positive; vegetative and aerial mycelia; spore-forming; non-motile; marine bacterium; Pseudonocardiaceae; CSP 2010
Dehalogenimonas lykanthroporepellens is the type species of the genus Dehalogenimonas, which belongs to a deeply branching lineage within the phylum Chloroflexi. This strictly anaerobic, mesophilic, non spore-forming, Gram-negative staining bacterium was first isolated from chlorinated solvent contaminated groundwater at a Superfund site located near Baton Rouge, Louisiana, USA. D. lykanthroporepellens was of interest for genome sequencing for two reasons: (a) an unusual ability to couple growth with reductive dechlorination of environmentally important polychlorinated aliphatic alkanes and (b) a phylogenetic position that is distant from previously sequenced bacteria. The 1,686,510 bp circular chromosome of strain BL-DC-9T contains 1,720 predicted protein coding genes, 47 tRNA genes, a single large subunit rRNA (23S-5S) locus, and a single, orphan, small subunit rRNA (16S) locus.
reductive dechlorination; groundwater; strictly anaerobic; hydrogen utilization; contamination; Chloroflexi
Saccharomonospora azurea Runmao et al. 1987 is a member of the genus Saccharomonospora, which is in the family Pseudonocardiaceae and thus far poorly characterized genomically. Members of the genus Saccharomonospora are of interest because they originate from diverse habitats, such as leaf litter, manure, compost, the surface of peat, and moist and over-heated grain, and may play a role in the primary degradation of plant material by attacking hemicellulose. Next to S. viridis, S. azurea is only the second member in the genus Saccharomonospora for which a completely sequenced type strain genome will be published. Here we describe the features of this organism, together with the complete genome sequence with project status ‘Improved high quality draft’, and the annotation. The 4,763,832 bp long chromosome with its 4,472 protein-coding and 58 RNA genes was sequenced as part of the DOE funded Community Sequencing Program (CSP) 2010 at the Joint Genome Institute (JGI).
aerobic; chemoheterotrophic; Gram-positive; vegetative and aerial mycelia; spore-forming; non-motile; soil bacterium; Pseudonocardiaceae; CSP 2010
A plant-associated member of the family Enterobacteriaceae, Serratia plymuthica strain AS12 was isolated from rapeseed roots. It is of scientific interest because it promotes plant growth and inhibits plant pathogens. The genome of S. plymuthica AS12 comprises a 5,443,009 bp long circular chromosome, which consists of 4,952 protein-coding genes, 87 tRNA genes and 7 rRNA operons. This genome was sequenced within the 2010 DOE-JGI Community Sequencing Program (CSP2010) as part of the project entitled “Genomics of four rapeseed plant growth promoting bacteria with antagonistic effect on plant pathogens”.
Facultative anaerobe; gram-negative; motile; non-sporulating; mesophilic; chemoorganotrophic; agriculture; Enterobacteriaceae; CSP 2010
Ruminococcus albus 7 is a highly cellulolytic ruminal bacterium that is a member of the phylum Firmicutes. Here, we describe the complete genome of this microbe. This genome will be useful for rumen microbiology and cellulosome biology and in biofuel production, as one of its major fermentation products is ethanol.
Marinomonas mediterranea MMB-1T Solano & Sanchez-Amat 1999 belongs to the family Oceanospirillaceae within the phylum Proteobacteria. This species is of interest because it is the only species described in the genus Marinomonas to date that can synthesize melanin pigments, which is mediated by the activity of a tyrosinase. M. mediterranea expresses other oxidases of biotechnological interest, such as a multicopper oxidase with laccase activity and a novel L-lysine-epsilon-oxidase. The 4,684,316 bp long genome harbors 4,228 protein-coding genes and 98 RNA genes and is a part of the Genomic
Gram-negative; marine; melanin; laccase; tyrosinase; lysine oxidase
Zymomonas mobilis ATCC 10988 is the type strain of the Z. mobilis subsp. mobilis taxon, members of which are some of the most rigorous ethanol-producing bacteria. Isolated from Agave cactus fermentations in Mexico, ATCC 10988 is one of the first Z. mobilis strains to be described and studied. Its robustness in sucrose-substrate fermentations, physiological characteristics, large number of plasmids, and overall genomic plasticity render this strain important to the study of the species. Here we report the finishing and annotation of the ATCC 10988 chromosomal and plasmid genome.
Zymomonas mobilis is an alphaproteobacterium studied for bioethanol production. Different strains of this organism have been hitherto sequenced; they all belong to the Z. mobilis subsp. mobilis taxon. Here we report the finished and annotated genome sequence of strain ATCC 29192, a cider-spoiling agent isolated in the United Kingdom. ATCC 29192 is the lectotype of the second-best-characterized subspecies of Z. mobilis, Z. mobilis subsp. pomaceae. The nucleotide sequence of ATCC 29192 deviates from that of Z. mobilis subsp. mobilis representatives, which justifies its distinct taxonomic positioning and proves particularly useful for comparative and functional genomic analyses.
Terephthalate (TA) is one of the top 50 chemicals produced worldwide. Its production results in a TA-containing wastewater that is treated by anaerobic processes through a poorly understood methanogenic syntrophy. Using metagenomics, we characterized the methanogenic consortium inside a hyper-mesophilic (that is, between mesophilic and thermophilic), TA-degrading bioreactor. We identified genes belonging to dominant Pelotomaculum species presumably involved in TA degradation through decarboxylation, dearomatization, and modified β-oxidation to H2/CO2 and acetate. These intermediates are converted to CH4/CO2 by three novel hyper-mesophilic methanogens. Additional secondary syntrophic interactions were predicted in Thermotogae, Syntrophus and candidate phyla OP5 and WWE1 populations. The OP5 encodes genes capable of anaerobic autotrophic butyrate production and Thermotogae, Syntrophus and WWE1 have the genetic potential to oxidize butyrate to CO2/H2 and acetate. These observations suggest that the TA-degrading consortium consists of additional syntrophic interactions beyond the standard H2-producing syntroph–methanogen partnership that may serve to improve community stability.
metagenomics; methanogenesis; syntroph; microbial diversity; carbon cycling
Thioalkalivibrio sp. K90mix is an obligately chemolithoautotrophic, natronophilic sulfur-oxidizing bacterium (SOxB) belonging to the family Ectothiorhodospiraceae within the Gammaproteobacteria. The strain was isolated from a mixture of sediment samples obtained from different soda lakes located in the Kulunda Steppe (Altai, Russia) based on its extreme potassium carbonate tolerance as an enrichment method. Here we report the complete genome sequence of strain K90mix and its annotation. The genome was sequenced within the Joint Genome Institute Community Sequencing Program, because of its relevance to the sustainable removal of sulfide from wastewater and gas streams.
natronophilic; sulfide; thiosulfate; sulfur-oxidizing bacteria; soda lakes
The Integrated Microbial Genomes (IMG) system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context. IMG integrates publicly available draft and complete genomes from all three domains of life with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and reviewing the annotations of genes and genomes in a comparative context. IMG's data content and analytical capabilities have been continuously extended through regular updates since its first release in March 2005. IMG is available at http://img.jgi.doe.gov. Companion IMG systems provide support for expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er), teaching courses and training in microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu) and analysis of genomes related to the Human Microbiome Project (IMG/HMP: http://www.hmpdacc-resources.org/img_hmp).