The Enterobacteriaceae bacterium strain FGI 57 was isolated from a fungus garden of the leaf-cutter ant Atta colombica. Analysis of its single 4.76-Mbp chromosome will shed light on community dynamics and plant biomass degradation in ant fungus gardens.
Effective comparative analysis of microbial genomes requires a consistent and complete view of biological data. Consistency regards the biological coherence of annotations, while completeness regards the extent and coverage of functional characterization for genomes. We have developed tools that allow scientists to assess and improve the consistency and completeness of microbial genome annotations in the context of the Integrated Microbial Genomes (IMG) family of systems. All publicly available microbial genomes are characterized in IMG using different functional annotation and pathway resources, thus providing a comprehensive framework for identifying and resolving annotation discrepancies. A rule based system for predicting phenotypes in IMG provides a powerful mechanism for validating functional annotations, whereby the phenotypic traits of an organism are inferred based on the presence of certain metabolic reactions and pathways and compared to experimentally observed phenotypes. The IMG family of systems are available at http://img.jgi.doe.gov/.
The strains Thermus oshimai JL-2 and Thermus thermophilus JL-18 each have a circular chromosome, 2.07 Mb and 1.9 Mb in size, respectively, and each has two plasmids ranging from 0.27 Mb to 57.2 kb. The megaplasmid of each strain contains a gene cluster for the reduction of nitrate to nitrous oxide, consistent with their incomplete denitrification phenotypes.
Niabella soli Weon et al. 2008 is a member of the Chitinophagaceae, a family within the class Sphingobacteriia that is poorly characterized at the genome level, thus far. N. soli strain JS13-8T is of interest for its ability to produce a variety of glycosyl hydrolases. The genome of N. soli strain JS13-8T is only the second genome sequence of a type strain from the family Chitinophagaceae to be published, and the first one from the genus Niabella. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,697,343 bp long chromosome with its 3,931 protein-coding and 49 RNA genes is a part of the Genomic
aerobic; non-motile; Gram-negative; mesophilic; chemoorganotrophic; glycosyl hydrolases; soil; Chitinophagaceae; GEBA
Gillisia limnaea Van Trappen et al. 2004 is the type species of the genus Gillisia, which is a member of the well characterized family Flavobacteriaceae. The genome of G. limnea R-8282T is the first sequenced genome (permanent draft) from a type strain of the genus Gillisia. Here we describe the features of this organism, together with the permanent-draft genome sequence and annotation. The 3,966,857 bp long chromosome (two scaffolds) with its 3,569 protein-coding and 51 RNA genes is a part of the Genomic
Encyclopedia of Bacteria and Archaea project.
aerobic; motile; rod-shaped; moderately halotolerant; psychrophilic; chemoheterotrophic; proteorhodopsin; microbial mat; yellow-pigmented; Flavobacteriaceae; GEBA
Owenweeksia hongkongensis Lau et al. 2005 is the sole member of the monospecific genus Owenweeksia in the family Cryomorphaceae, a poorly characterized family at the genome level thus far. This family comprises seven genera within the class Flavobacteria. Family members are known to be psychrotolerant, rod-shaped and orange pigmented (β-carotene), typical for Flavobacteria. For growth, seawater and complex organic nutrients are necessary. The genome of O. hongkongensis UST20020801T is only the second genome of a member of the family Cryomorphaceae whose sequence has been deciphered. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,000,057 bp long chromosome with its 3,518 protein-coding and 45 RNA genes is a part of the Genomic
aerobic; motile; rod-shaped; mesophilic; non-fermentative; Gram-negative; orange-pigmented sea water; Bacteroidetes; Flavobacteria; Cryomorphaceae; GEBA
Sulfobacillus acidophilus Norris et al. 1996 is a member of the genus Sulfobacillus which comprises five species of the order Clostridiales. Sulfobacillus species are of interest for comparison to other sulfur and iron oxidizers and also have biomining applications. This is the first completed genome sequence of a type strain of the genus Sulfobacillus, and the second published genome of a member of the species S. acidophilus. The genome, which consists of one chromosome and one plasmid with a total size of 3,557,831 bp harbors 3,626 protein-coding and 69 RNA genes, and is a part of the Genomic
aerobic; motile; Gram-positive; acidophilic; moderately thermophilic; sulfide- and iron-oxidizing; biomining; autotrophic; mixotrophic; soil; insertis sedis; Clostridiales; GEBA
The Integrated Microbial Genomes and Metagenomes (IMG/M) resource is a data management system that supports the analysis of sequence data from microbial communities in the integrated context of all publicly available draft and complete genomes from the three domains of life as well as a large number of plasmids and viruses. IMG/M currently contains thousands of genomes and metagenome samples with billions of genes. IMG/M-HMP is an IMG/M data mart serving the US National Institutes of Health (NIH) Human Microbiome Project (HMP), focussed on HMP generated metagenome datasets, and is one of the central resources provided from the HMP Data Analysis and Coordination Center (DACC). IMG/M-HMP is available at http://www.hmpdacc-resources.org/imgm_hmp/.
Runella slithyformis Larkin and Williams 1978 is the type species of the genus Runella, which belongs to the Cytophagaceae, a family that was only recently classified to the order Cytophagales in the class Cytophagia. The species is of interest because it is able to grow at temperatures as low as 4°C. This is the first completed genome sequence of a member of the genus Runella and the sixth sequence from the family Cytophagaceae. The 6,919,729 bp long genome consists of a 6.6 Mbp circular genome and five circular plasmids of 38.8 to 107.0 kbp length, harboring a total of 5,974 protein-coding and 51 RNA genes and is a part of the Genomic
strictly aerobic; non-motile; Gram-negative; psychrotolerant; chemoorganotrophic; Cytophagaceae; Cytophagia; GEBA
Thermodesulfatator indicus Moussard et al. 2004 is a member of the Thermodesulfobacteriaceae, a family in the phylum Thermodesulfobacteria that is currently poorly characterized at the genome level. Members of this phylum are of interest because they represent a distinct, deep-branching, Gram-negative lineage. T. indicus is an anaerobic, thermophilic, chemolithoautotrophic sulfate reducer isolated from a deep-sea hydrothermal vent. Here we describe the features of this organism, together with the complete genome sequence, and annotation. The 2,322,224 bp long chromosome with its 2,233 protein-coding and 58 RNA genes is a part of the Genomic
strictly anaerobic; motile; Gram-negative; thermophilic; sulfate-reducing; chemolithoautotrophic; black smoker; Thermodesulfobacteria; Thermodesulfobacteriaceae; GEBA
Thermovirga lienii Dahle and Birkeland 2006 is a member of the genus Thermovirga in the genomically moderately well characterized phylum 'Synergistetes'. Members of this relatively recently proposed phylum ‘Synergistetes’ are of interest because of their isolated phylogenetic position and their diverse habitats, e.g. from humans to oil wells. The genome of T. lienii Cas60314T is the fifth genome sequence (third completed) from this phylum to be published. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 1,999,646 bp long genome (including one plasmid) with its 1,914 protein-coding and 59 RNA genes is a part of the Genomic
anaerobic; chemoorganotrophic; Gram-negative; motile; thermophilic; marine oil well; Synergistaceae; GEBA
Holophaga foetida Liesack et al. 1995 is a member of the phylum Acidobacteria and is of interest for its ability to anaerobically degrade aromatic compounds and for its production of volatile sulfur compounds through a unique pathway. The genome of H. foetida strain TMBS4T is the first to be sequenced for a representative of the class Holophagae. Here we describe the features of this organism, together with the complete genome sequence (improved high quality draft), and annotation. The 4,127,237 bp long chromosome with its 3,615 protein-coding and 57 RNA genes is a part of the Genomic
anaerobic; motile; Gram-negative; mesophilic; chemoorganotrophic; sulfide-methylation; fresh water mud; Acidobacteria; Holophagaceae; GEBA
Muricauda ruestringensis Bruns et al. 2001 is the type species of the genus Muricauda, which belongs to the family Flavobacteriaceae in the phylum Bacteroidetes. The species is of interest because of its isolated position in the genomically unexplored genus Muricauda, which is located in a part of the tree of life containing not many organisms with sequenced genomes. The genome, which consists of a circular chromosome of 3,842,422 bp length with a total of 3,478 protein-coding and 47 RNA genes, is a part of the Genomic
facultatively anaerobic; non-motile; Gram-negative; mesophilic; marine; chemoheterotrophic; Flavobacteriaceae; GEBA
Saprospira grandis Gross 1911 is a member of the Saprospiraceae, a family in the class ‘Sphingobacteria’ that remains poorly characterized at the genomic level. The species is known for preying on other marine bacteria via ‘ixotrophy’. S. grandis strain Sa g1 was isolated from decaying crab carapace in France and was selected for genome sequencing because of its isolated location in the tree of life. Only one type strain genome has been published so far from the Saprospiraceae, while the sequence of strain Sa g1 represents the second genome to be published from a non-type strain of S. grandis. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 4,495,250 bp long Improved-High-Quality draft of the genome with its 3,536 protein-coding and 62 RNA genes is a part of the Genomic
strictly aerobic; gliding; Gram-negative; mesophilic; chemoorganotrophic; marine; ixotrophy; Saprospiraceae; GEBA
Spirochaeta coccoides Dröge et al. 2006 is a member of the genus Spirochaeta Ehrenberg 1835, one of the oldest named genera within the Bacteria. S. coccoides is an obligately anaerobic, Gram-negative, non-motile, spherical bacterium that was isolated from the hindgut contents of the termite Neotermes castaneus. The species is of interest because it may play an important role in the digestion of breakdown products from cellulose and hemicellulose in the termite gut. Here we provide a taxonomic re-evaluation for strain SPN1T, and based on physiological and genomic characteristics, we propose its reclassification as a novel species in the genus Sphaerochaeta, a recently published sister group of the Spirochaeta. The 2,227,296 bp long genome of strain SPN1T with its 1,866 protein-coding and 58 RNA genes is a part of the Genomic
obligately anaerobic; non-motile; termite hindgut; Gram-negative; di- and oligosaccharide-degrading; mesophilic; chemoorganotrophic; Spirochaetaceae; Sphaerochaeta; GEBA
Deinococcus proteolyticus (ex Kobatake et al. 1973) Brook and Murray 1981 is one of currently 47 species in the genus Deinococcus within the family Deinococcaceae. Strain MRPT was isolated from feces of Lama glama and possesses extreme radiation resistance, a trait is shares with various other species of the genus Deinococcus, with D. proteolyticus being resistant up to 1.5 Mrad of gamma radiation. Strain MRPT is of further interest for its carotenoid pigment. The genome presented here is only the fifth completed genome sequence of a member of the genus Deinococcus (and the forth type strain) to be published, and will hopefully contribute to a better understanding of how members of this genus adapted to high gamma- or UV ionizing-radiation. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,886,836 bp long genome with its four large plasmids of lengths 97 kbp, 132 kbp, 196 kbp and 315 kbp harbors 2,741 protein-coding and 58 RNA genes and is a part of the Genomic
strictly aerobic; non-motile; chemoorganotrophic; proteolytic; radioresistant; mesophile; carotenoid pigments; tetrad-forming cocci; Gram-positive; Deinococcaceae; GEBA
Sulfuricurvum kujiense Kodama and Watanabe 2004 is the type species of the monotypic genus Sulfuricurvum, which belongs to the family Helicobacteraceae in the class Epsilonproteobacteria. The species is of interest because it is frequently found in crude oil and oil sands where it utilizes various reduced sulfur compounds such as elemental sulfur, sulfide and thiosulfate as electron donors. Members of the species do not utilize sugars, organic acids or hydrocarbons as carbon and energy sources. This genome sequence represents the type strain of the only species in the genus Sulfuricurvum. The genome, which consists of a circular chromosome of 2,574,824 bp length and four plasmids of 118,585 bp, 71,513 bp, 51,014 bp, and 3,421 bp length, respectively, harboring a total of 2,879 protein-coding and 61 RNA genes and is a part of the Genomic
facultatively anaerobic; microaerobic; motile; Gram-negative; sulfur-oxidizing; mesophilic; chemolithoautotrophic; Helicobacteracea; GEBA
Marinithermus hydrothermalis Sako et al. 2003 is the type species of the monotypic genus Marinithermus. M. hydrothermalis T1T was the first isolate within the phylum “Thermus-Deinococcus” to exhibit optimal growth under a salinity equivalent to that of sea water and to have an absolute requirement for NaCl for growth. M. hydrothermalis T1T is of interest because it may provide a new insight into the ecological significance of the aerobic, thermophilic decomposers in the circulation of organic compounds in deep-sea hydrothermal vent ecosystems. This is the first completed genome sequence of a member of the genus Marinithermus and the seventh sequence from the family Thermaceae. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,269,167 bp long genome with its 2,251 protein-coding and 59 RNA genes is a part of the Genomic
strictly aerobic; non-motile; thermophilic; neutrophilic heterotroph; Gram-negative; hydrothermal vent; Thermaceae; GEBA
Thiothrix nivea (Rabenhorst 1865) Winogradsky 1888 (Approved Lists 1980) emend. Larkin and Shinabarger 1983 is the type species of the genus Thiothrix in the family Thiotrichaceae. The species is of interest not only because of its isolated location in the yet to be genomically characterized region of the tree of life, but also because of its life-style with gliding gonidia, the multilayer sheath, rosettes, and the embedded sulfur granules. Strain JP2T is the neotype strain of the species which was first observed by Rabenhorst in 1865 and later reclassified by Winogradsky in 1888 into the then novel genus Thiothrix. This is the first completed (improved-high-quality-draft) genome sequence to be published of a member of the family Thiotrichaceae. The genome in its current assembly consists of 15 contigs in four scaffolds with a total of 4,691,711 bp bearing 4,542 protein-coding and 52 RNA genes and is a part of the Genomic
strictly aerobic; gliding motility; Gram-negative; mesophile; sheath; filaments; sulfur granules; Thiotrichaceae; GEBA
Desulfurobacterium thermolithotrophum L'Haridon et al. 1998 is the type species of the genus Desulfurobacterium which belongs to the family Desulfurobacteriaceae. The species is of interest because it represents the first thermophilic bacterium that can act as a primary producer in the temperature range of 45-75 °C (optimum 70°C) and is incapable of growing under microaerophilic conditions. Strain BSAT preferentially synthesizes high-melting-point fatty acids (C18 and C20) which is hypothesized to be a strategy to ensure the functionality of the membrane at high growth temperatures. This is the second completed genome sequence of a member of the family Desulfurobacteriaceae and the first sequence from the genus Desulfurobacterium. The 1,541,968 bp long genome harbors 1,543 protein-coding and 51 RNA genes and is a part of the Genomic
anaerobic; thermophilic; neutrophilic; obligately chemolithoautotrophic; Gram-negative; marine; sulfur-reducing; Desulfurobacteriaceae; GEBA
One of the most important aspects of genomic analysis is the prediction of which pathways, both metabolic and non-metabolic, are present in an organism. In IMG, this is carried out by the assignment of IMG terms, which are organized into IMG pathways. Based on manual and automatic assignment of IMG terms, the presence or absence of IMG pathways is automatically inferred. The three categories of pathway assertion are asserted (likely present), not asserted (likely absent), and unknown. In the unknown category, at least one term necessary for the pathway is missing, but an ortholog in another organism has the corresponding term assigned to it. Automatic pathway inference is an important initial step in genome analysis.
The Integrated Microbial Genomes (IMG) system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context. IMG integrates publicly available draft and complete genomes from all three domains of life with a large number of plasmids and viruses. IMG provides tools and viewers for analyzing and reviewing the annotations of genes and genomes in a comparative context. IMG's data content and analytical capabilities have been continuously extended through regular updates since its first release in March 2005. IMG is available at http://img.jgi.doe.gov. Companion IMG systems provide support for expert review of genome annotations (IMG/ER: http://img.jgi.doe.gov/er), teaching courses and training in microbial genome analysis (IMG/EDU: http://img.jgi.doe.gov/edu) and analysis of genomes related to the Human Microbiome Project (IMG/HMP: http://www.hmpdacc-resources.org/img_hmp).
The Genomes OnLine Database (GOLD, http://www.genomesonline.org/) is a comprehensive resource for centralized monitoring of genome and metagenome projects worldwide. Both complete and ongoing projects, along with their associated metadata, can be accessed in GOLD through precomputed tables and a search page. As of September 2011, GOLD, now on version 4.0, contains information for 11 472 sequencing projects, of which 2907 have been completed and their sequence data has been deposited in a public repository. Out of these complete projects, 1918 are finished and 989 are permanent drafts. Moreover, GOLD contains information for 340 metagenome studies associated with 1927 metagenome samples. GOLD continues to expand, moving toward the goal of providing the most comprehensive repository of metadata information related to the projects and their organisms/environments in accordance with the Minimum Information about any (x) Sequence specification and beyond.
The integrated microbial genomes and metagenomes (IMG/M) system provides support for comparative analysis of microbial community aggregate genomes (metagenomes) in a comprehensive integrated context. IMG/M integrates metagenome data sets with isolate microbial genomes from the IMG system. IMG/M's data content and analytical capabilities have been extended through regular updates since its first release in 2007. IMG/M is available at http://img.jgi.doe.gov/m. A companion IMG/M systems provide support for annotation and expert review of unpublished metagenomic data sets (IMG/M ER: http://img.jgi.doe.gov/mer).
Ktedonobacter racemifer corrig. Cavaletti et al. 2007 is the type species of the genus Ktedonobacter, which in turn is the type genus of the family Ktedonobacteraceae, the type family of the order Ktedonobacterales within the class Ktedonobacteria in the phylum ‘Chloroflexi’. Although K. racemifer shares some morphological features with the actinobacteria, it is of special interest because it was the first cultivated representative of a deep branching unclassified lineage of otherwise uncultivated environmental phylotypes tentatively located within the phylum ‘Chloroflexi’. The aerobic, filamentous, non-motile, spore-forming Gram-positive heterotroph was isolated from soil in Italy. The 13,661,586 bp long non-contiguous finished genome consists of ten contigs and is the first reported genome sequence from a member of the class Ktedonobacteria. With its 11,453 protein-coding and 87 RNA genes, it is the largest prokaryotic genome reported so far. It comprises a large number of over-represented COGs, particularly genes associated with transposons, causing the genetic redundancy within the genome being considerably larger than expected by chance. This work is a part of the Genomic Encyclopedia of Bacteria and Archaea project.
aerobic; heterotrophic; filamentous; non-motile; Gram-positive; moderately acidophilic; sporulating; transposon; broken-stick distribution; entropy; Ktedonobacteraceae; Chloroflexi; GEBA