Search tips
Search criteria

Results 1-13 (13)

Clipboard (0)

Select a Filter Below

Year of Publication
Document Types
1.  CDD: a Conserved Domain Database for the functional annotation of proteins 
Nucleic Acids Research  2010;39(Database issue):D225-D229.
NCBI’s Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via
PMCID: PMC3013737  PMID: 21109532
2.  Evolution of AANAT: expansion of the gene family in the cephalochordate amphioxus 
The arylalkylamine N-acetyltransferase (AANAT) family is divided into structurally distinct vertebrate and non-vertebrate groups. Expression of vertebrate AANATs is limited primarily to the pineal gland and retina, where it plays a role in controlling the circadian rhythm in melatonin synthesis. Based on the role melatonin plays in biological timing, AANAT has been given the moniker "the Timezyme". Non-vertebrate AANATs, which occur in fungi and protists, are thought to play a role in detoxification and are not known to be associated with a specific tissue.
We have found that the amphioxus genome contains seven AANATs, all having non-vertebrate type features. This and the absence of AANATs from the genomes of Hemichordates and Urochordates support the view that a major transition in the evolution of the AANATs may have occurred at the onset of vertebrate evolution. Analysis of the expression pattern of the two most structurally divergent AANATs in Branchiostoma lanceolatum (bl) revealed that they are expressed early in development and also in the adult at low levels throughout the body, possibly associated with the neural tube. Expression is clearly not exclusively associated with the proposed analogs of the pineal gland and retina. blAANAT activity is influenced by environmental lighting, but light/dark differences do not persist under constant light or constant dark conditions, indicating they are not circadian in nature. bfAANATα and bfAANATδ' have unusually alkaline (> 9.0) optimal pH, more than two pH units higher than that of vertebrate AANATs.
The substrate selectivity profiles of bfAANATα and δ' are relatively broad, including alkylamines, arylalkylamines and diamines, in contrast to vertebrate forms, which selectively acetylate serotonin and other arylalkylamines. Based on these features, it appears that amphioxus AANATs could play several roles, including detoxification and biogenic amine inactivation. The presence of seven AANATs in amphioxus genome supports the view that arylalkylamine and polyamine acetylation is important to the biology of this organism and that these genes evolved in response to specific pressures related to requirements for amine acetylation.
PMCID: PMC2897805  PMID: 20500864
3.  Non-homologous isofunctional enzymes: A systematic analysis of alternative solutions in enzyme evolution 
Biology Direct  2010;5:31.
Evolutionarily unrelated proteins that catalyze the same biochemical reactions are often referred to as analogous - as opposed to homologous - enzymes. The existence of numerous alternative, non-homologous enzyme isoforms presents an interesting evolutionary problem; it also complicates genome-based reconstruction of the metabolic pathways in a variety of organisms. In 1998, a systematic search for analogous enzymes resulted in the identification of 105 Enzyme Commission (EC) numbers that included two or more proteins without detectable sequence similarity to each other, including 34 EC nodes where proteins were known (or predicted) to have distinct structural folds, indicating independent evolutionary origins. In the past 12 years, many putative non-homologous isofunctional enzymes were identified in newly sequenced genomes. In addition, efforts in structural genomics resulted in a vastly improved structural coverage of proteomes, providing for definitive assessment of (non)homologous relationships between proteins.
We report the results of a comprehensive search for non-homologous isofunctional enzymes (NISE) that yielded 185 EC nodes with two or more experimentally characterized - or predicted - structurally unrelated proteins. Of these NISE sets, only 74 were from the original 1998 list. Structural assignments of the NISE show over-representation of proteins with the TIM barrel fold and the nucleotide-binding Rossmann fold. From the functional perspective, the set of NISE is enriched in hydrolases, particularly carbohydrate hydrolases, and in enzymes involved in defense against oxidative stress.
These results indicate that at least some of the non-homologous isofunctional enzymes were recruited relatively recently from enzyme families that are active against related substrates and are sufficiently flexible to accommodate changes in substrate specificity.
This article was reviewed by Andrei Osterman, Keith F. Tipton (nominated by Martijn Huynen) and Igor B. Zhulin. For the full reviews, go to the Reviewers' comments section.
PMCID: PMC2876114  PMID: 20433725
4.  Bioinformatics and functional analysis define four distinct groups of AlkB DNA-dioxygenases in bacteria 
Nucleic Acids Research  2009;37(21):7124-7136.
The iron(II)- and 2-oxoglutarate (2OG)-dependent dioxygenase AlkB from Escherichia coli (EcAlkB) repairs alkylation damage in DNA by direct reversal. EcAlkB substrates include methylated bases, such as 1-methyladenine (m1A) and 3-methylcytosine (m3C), as well as certain bulkier lesions, for example the exocyclic adduct 1,N6-ethenoadenine (εA). EcAlkB is the only bacterial AlkB protein characterized to date, and we here present an extensive bioinformatics and functional analysis of bacterial AlkB proteins. Based on sequence phylogeny, we show that these proteins can be subdivided into four groups: denoted 1A, 1B, 2A and 2B; each characterized by the presence of specific conserved amino acid residues in the putative nucleotide-recognizing domain. A scattered distribution of AlkB proteins from the four different groups across the bacterial kingdom indicates a substantial degree of horizontal transfer of AlkB genes. DNA repair activity was associated with all tested recombinant AlkB proteins. Notably, both a group 2B protein from Xanthomonas campestris and a group 2A protein from Rhizobium etli repaired etheno adducts, but had negligible activity on methylated bases. Our data indicate that the majority, if not all, of the bacterial AlkB proteins are DNA repair enzymes, and that some of these proteins do not primarily target methylated bases.
PMCID: PMC2790896  PMID: 19786499
5.  Encapsulated in silica: genome, proteome and physiology of the thermophilic bacterium Anoxybacillus flavithermus WK1 
Genome Biology  2008;9(11):R161.
Sequencing of the complete genome of Anoxybacillus flavithermus reveals enzymes that are required for silica adaptation and biofilm formation.
Gram-positive bacteria of the genus Anoxybacillus have been found in diverse thermophilic habitats, such as geothermal hot springs and manure, and in processed foods such as gelatin and milk powder. Anoxybacillus flavithermus is a facultatively anaerobic bacterium found in super-saturated silica solutions and in opaline silica sinter. The ability of A. flavithermus to grow in super-saturated silica solutions makes it an ideal subject to study the processes of sinter formation, which might be similar to the biomineralization processes that occurred at the dawn of life.
We report here the complete genome sequence of A. flavithermus strain WK1, isolated from the waste water drain at the Wairakei geothermal power station in New Zealand. It consists of a single chromosome of 2,846,746 base pairs and is predicted to encode 2,863 proteins. In silico genome analysis identified several enzymes that could be involved in silica adaptation and biofilm formation, and their predicted functions were experimentally validated in vitro. Proteomic analysis confirmed the regulation of biofilm-related proteins and crucial enzymes for the synthesis of long-chain polyamines as constituents of silica nanospheres.
Microbial fossils preserved in silica and silica sinters are excellent objects for studying ancient life, a new paleobiological frontier. An integrated analysis of the A. flavithermus genome and proteome provides the first glimpse of metabolic adaptation during silicification and sinter formation. Comparative genome analysis suggests an extensive gene loss in the Anoxybacillus/Geobacillus branch after its divergence from other bacilli.
PMCID: PMC2614493  PMID: 19014707
6.  Viral AlkB proteins repair RNA damage by oxidative demethylation 
Nucleic Acids Research  2008;36(17):5451-5461.
Bacterial and mammalian AlkB proteins are iron(II)- and 2-oxoglutarate-dependent dioxygenases that reverse methylation damage, such as 1-methyladenine and 3-methylcytosine, in RNA and DNA. An AlkB-domain is encoded by the genome of numerous single-stranded, plant-infecting RNA viruses, the majority of which belong to the Flexiviridae family. Our phylogenetic analysis of AlkB sequences suggests that a single plant virus might have acquired AlkB relatively recently, followed by horizontal dissemination among other viruses via recombination. Here, we describe the first functional characterization of AlkB proteins from three plant viruses. The viral AlkB proteins efficiently reactivated methylated bacteriophage genomes when expressed in Escherichia coli, and also displayed robust, iron(II)- and 2-oxoglutarate-dependent demethylase activity in vitro. Viral AlkB proteins preferred RNA over DNA substrates, and thus represent the first AlkBs with such substrate specificity. Our results suggest a role for viral AlkBs in maintaining the integrity of the viral RNA genome through repair of deleterious methylation damage, and support the notion that AlkB-mediated RNA repair is biologically relevant.
PMCID: PMC2553587  PMID: 18718927
7.  Deinococcus geothermalis: The Pool of Extreme Radiation Resistance Genes Shrinks 
PLoS ONE  2007;2(9):e955.
Bacteria of the genus Deinococcus are extremely resistant to ionizing radiation (IR), ultraviolet light (UV) and desiccation. The mesophile Deinococcus radiodurans was the first member of this group whose genome was completely sequenced. Analysis of the genome sequence of D. radiodurans, however, failed to identify unique DNA repair systems. To further delineate the genes underlying the resistance phenotypes, we report the whole-genome sequence of a second Deinococcus species, the thermophile Deinococcus geothermalis, which at its optimal growth temperature is as resistant to IR, UV and desiccation as D. radiodurans, and a comparative analysis of the two Deinococcus genomes. Many D. radiodurans genes previously implicated in resistance, but for which no sensitive phenotype was observed upon disruption, are absent in D. geothermalis. In contrast, most D. radiodurans genes whose mutants displayed a radiation-sensitive phenotype in D. radiodurans are conserved in D. geothermalis. Supporting the existence of a Deinococcus radiation response regulon, a common palindromic DNA motif was identified in a conserved set of genes associated with resistance, and a dedicated transcriptional regulator was predicted. We present the case that these two species evolved essentially the same diverse set of gene families, and that the extreme stress-resistance phenotypes of the Deinococcus lineage emerged progressively by amassing cell-cleaning systems from different sources, but not by acquisition of novel DNA repair systems. Our reconstruction of the genomic evolution of the Deinococcus-Thermus phylum indicates that the corresponding set of enzymes proliferated mainly in the common ancestor of Deinococcus. Results of the comparative analysis weaken the arguments for a role of higher-order chromosome alignment structures in resistance; more clearly define and substantially revise downward the number of uncharacterized genes that might participate in DNA repair and contribute to resistance; and strengthen the case for a role in survival of systems involved in manganese and iron homeostasis.
PMCID: PMC1978522  PMID: 17895995
8.  Transcriptome Analysis Applied to Survival of Shewanella oneidensis MR-1 Exposed to Ionizing Radiation 
Journal of Bacteriology  2006;188(3):1199-1204.
The ionizing radiation (IR) dose that yields 20% survival (D20) of Shewanella oneidensis MR-1 is lower by factors of 20 and 200 than those for Escherichia coli and Deinococcus radiodurans, respectively. Transcriptome analysis was used to identify the genes of MR-1 responding to 40 Gy (D20). We observed the induction of 170 genes and repression of 87 genes in MR-1 during a 1-h recovery period after irradiation. The genomic response of MR-1 to IR is very similar to its response to UV radiation (254 nm), which included induction of systems involved in DNA repair and prophage synthesis and the absence of differential regulation of tricarboxylic acid cycle activity, which occurs in IR-irradiated D. radiodurans. Furthermore, strong induction of genes encoding antioxidant enzymes in MR-1 was observed. DNA damage may not be the principal cause of high sensitivity to IR, considering that MR-1 carries genes encoding a complex set of DNA repair systems and 40 Gy IR induces less than one double-strand break in its genome. Instead, a combination of oxidative stress, protein damage, and prophage-mediated cell lysis during irradiation and recovery might underlie this organism's great sensitivity to IR.
PMCID: PMC1347324  PMID: 16428429
9.  Comparative genomics of Thermus thermophilus and Deinococcus radiodurans: divergent routes of adaptation to thermophily and radiation resistance 
Thermus thermophilus and Deinococcus radiodurans belong to a distinct bacterial clade but have remarkably different phenotypes. T. thermophilus is a thermophile, which is relatively sensitive to ionizing radiation and desiccation, whereas D. radiodurans is a mesophile, which is highly radiation- and desiccation-resistant. Here we present an in-depth comparison of the genomes of these two related but differently adapted bacteria.
By reconstructing the evolution of Thermus and Deinococcus after the divergence from their common ancestor, we demonstrate a high level of post-divergence gene flux in both lineages. Various aspects of the adaptation to high temperature in Thermus can be attributed to horizontal gene transfer from archaea and thermophilic bacteria; many of the horizontally transferred genes are located on the single megaplasmid of Thermus. In addition, the Thermus lineage has lost a set of genes that are still present in Deinococcus and many other mesophilic bacteria but are not common among thermophiles. By contrast, Deinococcus seems to have acquired numerous genes related to stress response systems from various bacteria. A comparison of the distribution of orthologous genes among the four partitions of the Deinococcus genome and the two partitions of the Thermus genome reveals homology between the Thermus megaplasmid (pTT27) and Deinococcus megaplasmid (DR177).
After the radiation from their common ancestor, the Thermus and Deinococcus lineages have taken divergent paths toward their distinct lifestyles. In addition to extensive gene loss, Thermus seems to have acquired numerous genes from thermophiles, which likely was the decisive contribution to its thermophilic adaptation. By contrast, Deinococcus lost few genes but seems to have acquired many bacterial genes that apparently enhanced its ability to survive different kinds of environmental stresses. Notwithstanding the accumulation of horizontally transferred genes, we also show that the single megaplasmid of Thermus and the DR177 megaplasmid of Deinococcus are homologous and probably were inherited from the common ancestor of these bacteria.
PMCID: PMC1274311  PMID: 16242020
10.  Genome-Wide Molecular Clock and Horizontal Gene Transfer in Bacterial Evolution 
Journal of Bacteriology  2004;186(19):6575-6585.
We describe a simple theoretical framework for identifying orthologous sets of genes that deviate from a clock-like model of evolution. The approach used is based on comparing the evolutionary distances within a set of orthologs to a standard intergenomic distance, which was defined as the median of the distribution of the distances between all one-to-one orthologs. Under the clock-like model, the points on a plot of intergenic distances versus intergenomic distances are expected to fit a straight line. A statistical technique to identify significant deviations from the clock-like behavior is described. For several hundred analyzed orthologous sets representing three well-defined bacterial lineages, the α-Proteobacteria, the γ-Proteobacteria, and the Bacillus-Clostridium group, the clock-like null hypothesis could not be rejected for ∼70% of the sets, whereas the rest showed substantial anomalies. Subsequent detailed phylogenetic analysis of the genes with the strongest deviations indicated that over one-half of these genes probably underwent a distinct form of horizontal gene transfer, xenologous gene displacement, in which a gene is displaced by an ortholog from a different lineage. The remaining deviations from the clock-like model could be explained by lineage-specific acceleration of evolution. The results indicate that although xenologous gene displacement is a major force in bacterial evolution, a significant majority of orthologous gene sets in three major bacterial lineages evolved in accordance with the clock-like model. The approach described here allows rapid detection of deviations from this mode of evolution on the genome scale.
PMCID: PMC516599  PMID: 15375139
11.  Evolution of mosaic operons by horizontal gene transfer and gene displacement in situ 
Genome Biology  2003;4(9):R55.
Comparative genomics and phylogenetic analysis have been used to examine horizontal transfer of entire operons versus displacement of individual genes within operons by horizontally acquired orthologs and independent assembly of the same or similar operons from genes with different phylogenetic affinities.
Shuffling and disruption of operons and horizontal gene transfer are major contributions to the new, dynamic view of prokaryotic evolution. Under the 'selfish operon' hypothesis, operons are viewed as mobile genetic entities that are constantly disseminated via horizontal gene transfer, although their retention could be favored by the advantage of coregulation of functionally linked genes. Here we apply comparative genomics and phylogenetic analysis to examine horizontal transfer of entire operons versus displacement of individual genes within operons by horizontally acquired orthologs and independent assembly of the same or similar operons from genes with different phylogenetic affinities.
Since a substantial number of operons have been identified experimentally in only a few model bacteria, evolutionarily conserved gene strings were analyzed as surrogates of operons. The phylogenetic affinities within these predicted operons were assessed first by sequence similarity analysis and then by phylogenetic analysis, including statistical tests of tree topology. Numerous cases of apparent horizontal transfer of entire operons were detected. However, it was shown that apparent horizontal transfer of individual genes or arrays of genes within operons is not uncommon either and results in xenologous gene displacement in situ, that is, displacement of an ancestral gene by a horizontally transferred ortholog from a taxonomically distant organism without change of the local gene organization. On rarer occasions, operons might have evolved via independent assembly, in part from horizontally acquired genes.
The discovery of in situ gene displacement shows that combination of rampant horizontal gene transfer with selection for preservation of operon structure provides for events in prokaryotic evolution that, a priori, seem improbable. These findings also emphasize that not all aspects of operon evolution are selfish, with operon integrity maintained by purifying selection at the organism level.
PMCID: PMC193655  PMID: 12952534
12.  Genome Sequence and Comparative Analysis of the Solvent-Producing Bacterium Clostridium acetobutylicum 
Journal of Bacteriology  2001;183(16):4823-4838.
The genome sequence of the solvent-producing bacterium Clostridium acetobutylicum ATCC 824 has been determined by the shotgun approach. The genome consists of a 3.94-Mb chromosome and a 192-kb megaplasmid that contains the majority of genes responsible for solvent production. Comparison of C. acetobutylicum to Bacillus subtilis reveals significant local conservation of gene order, which has not been seen in comparisons of other genomes with similar, or, in some cases closer, phylogenetic proximity. This conservation allows the prediction of many previously undetected operons in both bacteria. However, the C. acetobutylicum genome also contains a significant number of predicted operons that are shared with distantly related bacteria and archaea but not with B. subtilis. Phylogenetic analysis is compatible with the dissemination of such operons by horizontal transfer. The enzymes of the solventogenesis pathway and of the cellulosome of C. acetobutylicum comprise a new set of metabolic capacities not previously represented in the collection of complete genomes. These enzymes show a complex pattern of evolutionary affinities, emphasizing the role of lateral gene exchange in the evolution of the unique metabolic profile of the bacterium. Many of the sporulation genes identified in B. subtilis are missing in C. acetobutylicum, which suggests major differences in the sporulation process. Thus, comparative analysis reveals both significant conservation of the genome organization and pronounced differences in many systems that reflect unique adaptive strategies of the two gram-positive bacteria.
PMCID: PMC99537  PMID: 11466286
13.  Complete genome sequence of the extremely acidophilic methanotroph isolate V4, Methylacidiphilum infernorum, a representative of the bacterial phylum Verrucomicrobia 
Biology Direct  2008;3:26.
The phylum Verrucomicrobia is a widespread but poorly characterized bacterial clade. Although cultivation-independent approaches detect representatives of this phylum in a wide range of environments, including soils, seawater, hot springs and human gastrointestinal tract, only few have been isolated in pure culture. We have recently reported cultivation and initial characterization of an extremely acidophilic methanotrophic member of the Verrucomicrobia, strain V4, isolated from the Hell's Gate geothermal area in New Zealand. Similar organisms were independently isolated from geothermal systems in Italy and Russia.
We report the complete genome sequence of strain V4, the first one from a representative of the Verrucomicrobia. Isolate V4, initially named "Methylokorus infernorum" (and recently renamed Methylacidiphilum infernorum) is an autotrophic bacterium with a streamlined genome of ~2.3 Mbp that encodes simple signal transduction pathways and has a limited potential for regulation of gene expression. Central metabolism of M. infernorum was reconstructed almost completely and revealed highly interconnected pathways of autotrophic central metabolism and modifications of C1-utilization pathways compared to other known methylotrophs. The M. infernorum genome does not encode tubulin, which was previously discovered in bacteria of the genus Prosthecobacter, or close homologs of any other signature eukaryotic proteins. Phylogenetic analysis of ribosomal proteins and RNA polymerase subunits unequivocally supports grouping Planctomycetes, Verrucomicrobia and Chlamydiae into a single clade, the PVC superphylum, despite dramatically different gene content in members of these three groups. Comparative-genomic analysis suggests that evolution of the M. infernorum lineage involved extensive horizontal gene exchange with a variety of bacteria. The genome of M. infernorum shows apparent adaptations for existence under extremely acidic conditions including a major upward shift in the isoelectric points of proteins.
The results of genome analysis of M. infernorum support the monophyly of the PVC superphylum. M. infernorum possesses a streamlined genome but seems to have acquired numerous genes including those for enzymes of methylotrophic pathways via horizontal gene transfer, in particular, from Proteobacteria.
This article was reviewed by John A. Fuerst, Ludmila Chistoserdova, and Radhey S. Gupta.
PMCID: PMC2474590  PMID: 18593465

Results 1-13 (13)