The benefits of using transgenic switchgrass with decreased levels of caffeic acid 3-O-methyltransferase (COMT) as biomass feedstock have been clearly demonstrated. However, its effect on the soil microbial community has not been assessed. Here we report metagenomic and metatranscriptomic analyses of root-associated soil from COMT switchgrass compared with nontransgenic counterparts.
Bacteria belonging to the phylum Gemmatimonadetes are found in a wide variety of environments and are particularly abundant in soils. Here, we present the complete genome sequence and methylation pattern of the newly described Gemmatirosa kalamazoonensis type strain.
The thermophilic anaerobe Clostridium thermocellum is a candidate consolidated bioprocessing (CBP) biocatalyst for cellulosic ethanol production. The aim of this study was to investigate C. thermocellum genes required to ferment biomass substrates and to conduct a robust comparison of DNA microarray and RNA sequencing (RNA-seq) analytical platforms.
C. thermocellum ATCC 27405 fermentations were conducted with a 5 g/L solid substrate loading of either pretreated switchgrass or Populus. Quantitative saccharification and inductively coupled plasma emission spectroscopy (ICP-ES) for elemental analysis revealed composition differences between biomass substrates, which may have influenced growth and transcriptomic profiles. High quality RNA was prepared for C. thermocellum grown on solid substrates and transcriptome profiles were obtained for two time points during active growth (12 hours and 37 hours postinoculation). A comparison of two transcriptomic analytical techniques, microarray and RNA-seq, was performed and the data analyzed for statistical significance. Large expression differences for cellulosomal genes were not observed. We updated gene predictions for the strain and a small novel gene, Cthe_3383, with a putative AgrD peptide quorum sensing function was among the most highly expressed genes. RNA-seq data also supported different small regulatory RNA predictions over others. The DNA microarray gave a greater number (2,351) of significant genes relative to RNA-seq (280 genes when normalized by the kernel density mean of M component (KDMM) method) in an analysis of variance (ANOVA) testing method with a 5% false discovery rate (FDR). When a 2-fold difference in expression threshold was applied, 73 genes were significantly differentially expressed in common between the two techniques. Sulfate and phosphate uptake/utilization genes, along with genes for a putative efflux pump system were some of the most differentially regulated transcripts when profiles for C. thermocellum grown on either pretreated switchgrass or Populus were compared.
Our results suggest that a high degree of agreement in differential gene expression measurements between transcriptomic platforms is possible, but choosing an appropriate normalization regime is essential.
Genome; Reannotation; Biomass; Elemental composition; RNA-seq; Microarray; Phosphate; Normalization; Transcriptomics
Granulicella mallensis MP5ACTX8T is a novel species of the genus Granulicella in subdivision 1of Acidobacteria. G. mallensis is of ecological interest being a member of the dominant soil bacterial community active at low temperatures and nutrient limiting conditions in Arctic alpine tundra. G. mallensis is a cold-adapted acidophile and a versatile heterotroph that hydrolyzes a suite of sugars and complex polysaccharides. Genome analysis revealed metabolic versatility with genes involved in metabolism and transport of carbohydrates. These include gene modules encoding the carbohydrate-active enzyme (CAZyme) family involved in breakdown, utilization and biosynthesis of diverse structural and storage polysaccharides including plant based carbon polymers. The genome of Granulicella mallensis MP5ACTX8T consists of a single replicon of 6,237,577 base pairs (bp) with 4,907 protein-coding genes and 53 RNA genes.
cold adapted; acidophile; tundra soil; Acidobacteria
The genomes of the Betaproteobacteria Alicycliphilus denitrificans strains BC and K601T have been sequenced to get insight into the physiology of the two strains. Strain BC degrades benzene with chlorate as electron acceptor. The cyclohexanol-degrading denitrifying strain K601T is not able to use chlorate as electron acceptor, while strain BC cannot degrade cyclohexanol. The 16S rRNA sequences of strains BC and K601T are identical and the fatty acid methyl ester patterns of the strains are similar. Basic Local Alignment Search Tool (BLAST) analysis of predicted open reading frames of both strains showed most hits with Acidovorax sp. JS42, a bacterium that degrades nitro-aromatics. The genomes include strain-specific plasmids (pAlide201 in strain K601T and pAlide01 and pAlide02 in strain BC). Key genes of chlorate reduction in strain BC were located on a 120 kb megaplasmid (pAlide01), which was absent in strain K601T. Genes involved in cyclohexanol degradation were only found in strain K601T. Benzene and toluene are degraded via oxygenase-mediated pathways in both strains. Genes involved in the meta-cleavage pathway of catechol are present in the genomes of both strains. Strain BC also contains all genes of the ortho-cleavage pathway. The large number of mono- and dioxygenase genes in the genomes suggests that the two strains have a broader substrate range than known thus far.
Extremely thermophilic bacteria of the genus Caldicellulosiruptor utilize carbohydrate components of plant cell walls, including cellulose and hemicellulose, facilitated by a diverse set of glycoside hydrolases (GHs). From a biofuel perspective, this capability is crucial for deconstruction of plant biomass into fermentable sugars. While all species from the genus grow on xylan and acid-pretreated switchgrass, growth on crystalline cellulose is variable. The basis for this variability was examined using microbiological, genomic, and proteomic analyses of eight globally diverse Caldicellulosiruptor species. The open Caldicellulosiruptor pangenome (4,009 open reading frames [ORFs]) encodes 106 GHs, representing 43 GH families, but only 26 GHs from 17 families are included in the core (noncellulosic) genome (1,543 ORFs). Differentiating the strongly cellulolytic Caldicellulosiruptor species from the others is a specific genomic locus that encodes multidomain cellulases from GH families 9 and 48, which are associated with cellulose-binding modules. This locus also encodes a novel adhesin associated with type IV pili, which was identified in the exoproteome bound to crystalline cellulose. Taking into account the core genomes, pangenomes, and individual genomes, the ancestral Caldicellulosiruptor was likely cellulolytic and evolved, in some cases, into species that lost the ability to degrade crystalline cellulose while maintaining the capacity to hydrolyze amorphous cellulose and hemicellulose.
Toxic cyanobacterial blooms have persisted in freshwater systems around the world for centuries and appear to be globally increasing in frequency and severity. Toxins produced by bloom-associated cyanobacteria can have drastic impacts on the ecosystem and surrounding communities, and bloom biomass can disrupt aquatic food webs and act as a driver for hypoxia. Little is currently known regarding the genomic content of the Microcystis strains that form blooms or the companion heterotrophic community associated with bloom events. To address these issues, we examined the bloom-associated microbial communities in single samples from Lake Erie (North America), Lake Tai (Taihu, China), and Grand Lakes St. Marys (OH, USA) using comparative metagenomics. Together the Cyanobacteria and Proteobacteria comprised >90% of each bloom bacterial community sample, although the dominant phylum varied between systems. Relative to the existing Microcystis aeruginosa NIES 843 genome, sequences from Lake Erie and Taihu revealed a number of metagenomic islands that were absent in the environmental samples. Moreover, despite variation in the phylogenetic assignments of bloom-associated organisms, the functional potential of bloom members remained relatively constant between systems. This pattern was particularly noticeable in the genomic contribution of nitrogen assimilation genes. In Taihu, the genetic elements associated with the assimilation and metabolism of nitrogen were predominantly associated with Proteobacteria, while these functions in the North American lakes were primarily contributed to by the Cyanobacteria. Our observations build on an emerging body of metagenomic surveys describing the functional potential of microbial communities as more highly conserved than that of their phylogenetic makeup within natural systems.
Paenibacillus sp.Y412MC10 was one of a number of organisms isolated from Obsidian Hot Spring, Yellowstone National Park, Montana, USA under permit from the National Park Service. The isolate was initially classified as a Geobacillus sp. Y412MC10 based on its isolation conditions and similarity to other organisms isolated from hot springs at Yellowstone National Park. Comparison of 16 S rRNA sequences within the Bacillales indicated that Geobacillus sp.Y412MC10 clustered with Paenibacillus species, and the organism was most closely related to Paenibacillus lautus. Lucigen Corp. prepared genomic DNA and the genome was sequenced, assembled, and annotated by the DOE Joint Genome Institute. The genome sequence was deposited at the NCBI in October 2009 (NC_013406). The genome of Paenibacillus sp. Y412MC10 consists of one circular chromosome of 7,121,665 bp with an average G+C content of 51.2%. Comparison to other Paenibacillus species shows the organism lacks nitrogen fixation, antibiotic production and social interaction genes reported in other paenibacilli. The Y412MC10 genome shows a high level of synteny and homology to the draft sequence of Paenibacillus sp. HGF5, an organism from the Human Microbiome Project (HMP) Reference Genomes. This, combined with genomic CAZyme analysis, suggests an intestinal, rather than environmental origin for Y412MC10.
Geobacillus sp. Y412MC10; Paenibacillus sp. Y412MC10; Obsidian Hot Spring
Ruminococcus albus 7 is a highly cellulolytic ruminal bacterium that is a member of the phylum Firmicutes. Here, we describe the complete genome of this microbe. This genome will be useful for rumen microbiology and cellulosome biology and in biofuel production, as one of its major fermentation products is ethanol.
Alicycliphilus denitrificans strain BC and A. denitrificans strain K601T degrade cyclic hydrocarbons. These strains have been isolated from a mixture of wastewater treatment plant material and benzene-polluted soil and from a wastewater treatment plant, respectively, suggesting their role in bioremediation of soil and water. Although the strains are phylogenetically closely related, there are some clear physiological differences. The hydrocarbon cyclohexanol, for example, can be degraded by strain K601T but not by strain BC. Furthermore, both strains can use nitrate and oxygen as an electron acceptor, but only strain BC can use chlorate as electron acceptor. To better understand the nitrate and chlorate reduction mechanisms coupled to the oxidation of cyclic compounds, the genomes of A. denitrificans strains BC and K601T were sequenced. Here, we report the complete genome sequences of A. denitrificans strains BC and K601T.
Desulfovibrio alaskensis G20 (formerly Desulfovibrio desulfuricans G20) is a Gram-negative mesophilic sulfate-reducing bacterium (SRB), known to corrode ferrous metals and to reduce toxic radionuclides and metals such as uranium and chromium to sparingly soluble and less toxic forms. We present the 3.7-Mb genome sequence to provide insights into its physiology.
Halanaerobium hydrogenoformans is an alkaliphilic bacterium capable of biohydrogen production at pH 11 and 7% (wt/vol) salt. We present the 2.6-Mb genome sequence to provide insights into its physiology and potential for bioenergy applications.
Here we present the genome of strain Exiguobacterium sp. AT1b, a thermophilic member of the genus Exiguobacterium whose representatives were isolated from various environments along a thermal and physicochemical gradient. This genome was sequenced to be a comparative resource for the study of thermal adaptation with a psychroactive representative of the genus, Exiguobacterium sibiricum strain 255-15, that was previously sequenced by the U.S. Department of Energy's (DOE's) Joint Genome Institute (JGI) (http://genome.ornl.gov/microbial/exig/).
Cellulosilyticum lentocellum DSM 5427 is an anaerobic, endospore-forming member of the Firmicutes. We describe the complete genome sequence of this cellulose-degrading bacterium, which was originally isolated from estuarine sediment of a river that received both domestic and paper mill waste. Comparative genomics of cellulolytic clostridia will provide insight into factors that influence degradation rates.
Desulfovibrio desulfuricans strain ND132 is an anaerobic sulfate-reducing bacterium (SRB) capable of producing methylmercury (MeHg), a potent human neurotoxin. The mechanism of methylation by this and other organisms is unknown. We present the 3.8-Mb genome sequence to provide further insight into microbial mercury methylation.
The genus Caldicellulosiruptor contains the most thermophilic, plant biomass-degrading bacteria isolated to date. Previously, genome sequences from three cellulolytic members of this genus were reported (C. saccharolyticus, C. bescii, and C. obsidiansis). To further explore the physiological and biochemical basis for polysaccharide degradation within this genus, five additional genomes were sequenced: C. hydrothermalis, C. kristjanssonii, C. kronotskyensis, C. lactoaceticus, and C. owensensis. Taken together, the seven completed and one draft-phase Caldicellulosiruptor genomes suggest that, while central metabolism is highly conserved, significant differences in glycoside hydrolase inventories and numbers of carbohydrate transporters exist, a finding which likely relates to variability observed in plant biomass degradation capacity.
Nocardioides sp. strain JS614 grows on ethene and vinyl chloride (VC) as sole carbon and energy sources and is of interest for bioremediation and biocatalysis. Sequencing of the complete genome of JS614 provides insight into the genetic basis of alkene oxidation, supports ongoing research into the physiology and biochemistry of growth on ethene and VC, and provides biomarkers to facilitate detection of VC/ethene oxidizers in the environment. This is the first genome sequence from the genus Nocardioides and the first genome of a VC/ethene-oxidizing bacterium.
Chloroflexus aurantiacus is a thermophilic filamentous anoxygenic phototrophic (FAP) bacterium, and can grow phototrophically under anaerobic conditions or chemotrophically under aerobic and dark conditions. According to 16S rRNA analysis, Chloroflexi species are the earliest branching bacteria capable of photosynthesis, and Cfl. aurantiacus has been long regarded as a key organism to resolve the obscurity of the origin and early evolution of photosynthesis. Cfl. aurantiacus contains a chimeric photosystem that comprises some characters of green sulfur bacteria and purple photosynthetic bacteria, and also has some unique electron transport proteins compared to other photosynthetic bacteria.
The complete genomic sequence of Cfl. aurantiacus has been determined, analyzed and compared to the genomes of other photosynthetic bacteria.
Abundant genomic evidence suggests that there have been numerous gene adaptations/replacements in Cfl. aurantiacus to facilitate life under both anaerobic and aerobic conditions, including duplicate genes and gene clusters for the alternative complex III (ACIII), auracyanin and NADH:quinone oxidoreductase; and several aerobic/anaerobic enzyme pairs in central carbon metabolism and tetrapyrroles and nucleic acids biosynthesis. Overall, genomic information is consistent with a high tolerance for oxygen that has been reported in the growth of Cfl. aurantiacus. Genes for the chimeric photosystem, photosynthetic electron transport chain, the 3-hydroxypropionate autotrophic carbon fixation cycle, CO2-anaplerotic pathways, glyoxylate cycle, and sulfur reduction pathway are present. The central carbon metabolism and sulfur assimilation pathways in Cfl. aurantiacus are discussed. Some features of the Cfl. aurantiacus genome are compared with those of the Roseiflexus castenholzii genome. Roseiflexus castenholzii is a recently characterized FAP bacterium and phylogenetically closely related to Cfl. aurantiacus. According to previous reports and the genomic information, perspectives of Cfl. aurantiacus in the evolution of photosynthesis are also discussed.
The genomic analyses presented in this report, along with previous physiological, ecological and biochemical studies, indicate that the anoxygenic phototroph Cfl. aurantiacus has many interesting and certain unique features in its metabolic pathways. The complete genome may also shed light on possible evolutionary connections of photosynthesis.
Modern methods to develop microbe-based biomass conversion processes require a system-level understanding of the microbes involved. Clostridium species have long been recognized as ideal candidates for processes involving biomass conversion and production of various biofuels and other industrial products. To expand the knowledge base for clostridial species relevant to current biofuel production efforts, we have sequenced the genomes of 20 species spanning multiple genera. The majority of species sequenced fall within the class III cellulosome-encoding Clostridium and the class V saccharolytic Thermoanaerobacteraceae. Species were chosen based on representation in the experimental literature as model organisms, ability to degrade cellulosic biomass either by free enzymes or by cellulosomes, ability to rapidly ferment hexose and pentose sugars to ethanol, and ability to ferment synthesis gas to ethanol. The sequenced strains significantly increase the number of noncommensal/nonpathogenic clostridial species and provide a key foundation for future studies of biomass conversion, cellulosome composition, and clostridial systems biology.
Caldicellulosiruptor obsidiansis OB47T (ATCC BAA-2073, JCM 16842) is an extremely thermophilic, anaerobic bacterium capable of hydrolyzing plant-derived polymers through the expression of multidomain/multifunctional hydrolases. The complete genome sequence reveals a diverse set of carbohydrate-active enzymes and provides further insight into lignocellulosic biomass hydrolysis at high temperatures.
The quality of automated gene prediction in microbial organisms has improved steadily over the past decade, but there is still room for improvement. Increasing the number of correct identifications, both of genes and of the translation initiation sites for each gene, and reducing the overall number of false positives, are all desirable goals.
With our years of experience in manually curating genomes for the Joint Genome Institute, we developed a new gene prediction algorithm called Prodigal (PROkaryotic DYnamic programming Gene-finding ALgorithm). With Prodigal, we focused specifically on the three goals of improved gene structure prediction, improved translation initiation site recognition, and reduced false positives. We compared the results of Prodigal to existing gene-finding methods to demonstrate that it met each of these objectives.
We built a fast, lightweight, open source gene prediction program called Prodigal http://compbio.ornl.gov/prodigal/. Prodigal achieved good results compared to existing methods, and we believe it will be a valuable asset to automated microbial annotation pipelines.
“Anaerocellum thermophilum” DSM 6725 is a strictly anaerobic bacterium that grows optimally at 75°C. It uses a variety of polysaccharides, including crystalline cellulose and untreated plant biomass, and has potential utility in biomass conversion. Here we report its complete genome sequence of 2.97 Mb, which is contained within one chromosome and two plasmids (of 8.3 and 3.6 kb). The genome encodes a broad set of cellulolytic enzymes, transporters, and pathways for sugar utilization and compared to those of other saccharolytic, anaerobic thermophiles is most similar to that of Caldicellulosiruptor saccharolyticus DSM 8903.
The complete genome of the ammonia-oxidizing bacterium Nitrosospira multiformis (ATCC 25196T) consists of a circular chromosome and three small plasmids totaling 3,234,309 bp and encoding 2,827 putative proteins. Of the 2,827 putative proteins, 2,026 proteins have predicted functions and 801 are without conserved functional domains, yet 747 of these have similarity to other predicted proteins in databases. Gene homologs from Nitrosomonas europaea and Nitrosomonas eutropha were the best match for 42% of the predicted genes in N. multiformis. The N. multiformis genome contains three nearly identical copies of amo and hao gene clusters as large repeats. The features of N. multiformis that distinguish it from N. europaea include the presence of gene clusters encoding urease and hydrogenase, a ribulose-bisphosphate carboxylase/oxygenase-encoding operon of distinctive structure and phylogeny, and a relatively small complement of genes related to Fe acquisition. Systems for synthesis of a pyoverdine-like siderophore and for acyl-homoserine lactone were unique to N. multiformis among the sequenced genomes of ammonia-oxidizing bacteria. Gene clusters encoding proteins associated with outer membrane and cell envelope functions, including transporters, porins, exopolysaccharide synthesis, capsule formation, and protein sorting/export, were abundant. Numerous sensory transduction and response regulator gene systems directed toward sensing of the extracellular environment are described. Gene clusters for glycogen, polyphosphate, and cyanophycin storage and utilization were identified, providing mechanisms for meeting energy requirements under substrate-limited conditions. The genome of N. multiformis encodes the core pathways for chemolithoautotrophy along with adaptations for surface growth and survival in soil environments.
Sulfur-oxidizing epsilonproteobacteria are common in a variety of sulfidogenic environments. These autotrophic and mixotrophic sulfur-oxidizing bacteria are believed to contribute substantially to the oxidative portion of the global sulfur cycle. In order to better understand the ecology and roles of sulfur-oxidizing epsilonproteobacteria, in particular those of the widespread genus Sulfurimonas, in biogeochemical cycles, the genome of Sulfurimonas denitrificans DSM1251 was sequenced. This genome has many features, including a larger size (2.2 Mbp), that suggest a greater degree of metabolic versatility or responsiveness to the environment than seen for most of the other sequenced epsilonproteobacteria. A branched electron transport chain is apparent, with genes encoding complexes for the oxidation of hydrogen, reduced sulfur compounds, and formate and the reduction of nitrate and oxygen. Genes are present for a complete, autotrophic reductive citric acid cycle. Many genes are present that could facilitate growth in the spatially and temporally heterogeneous sediment habitat from where Sulfurimonas denitrificans was originally isolated. Many resistance-nodulation-development family transporter genes (10 total) are present; of these, several are predicted to encode heavy metal efflux transporters. An elaborate arsenal of sensory and regulatory protein-encoding genes is in place, as are genes necessary to prevent and respond to oxidative stress.