Mycobacterium tuberculosis remains a major cause of death due to the lack of treatment accessibility, HIV coinfection, and drug resistance. Development of new drugs targeting previously unexplored pathways is essential to shorten treatment time and eliminate persistent M. tuberculosis. A promising biochemical pathway which may be targeted to kill both replicating and nonreplicating M. tuberculosis is the biosynthesis of NAD(H), an essential cofactor in multiple reactions crucial for respiration, redox balance, and biosynthesis of major building blocks. NaMN adenylyltransferase (NadD) and NAD synthetase (NadE), the key enzymes of NAD biosynthesis, were selected as promising candidate drug targets for M. tuberculosis. Here we report for the first time kinetic characterization of the recombinant purified NadD enzyme, setting the stage for its structural analysis and inhibitor development. A protein knockdown approach was applied to validate bothNadD and NadE as target enzymes. Induced degradation of either target enzyme showed a strong bactericidal effect which coincided with anticipated changes in relative levels of NaMN and NaAD intermediates (substrates of NadD and NadE, respectively) and ultimate depletion of the NAD(H) pool. A metabolic catastrophe predicted as a likely result of NAD(H) deprivation of cellular metabolism was confirmed by 13C biosynthetic labeling followed by gas chromatography-mass spectrometry (GC-MS) analysis. A sharp suppression of metabolic flux was observed in multiple NAD(P)(H)-dependent pathways, including synthesis of many amino acids (serine, proline, aromatic amino acids) and fatty acids. Overall, these results provide strong validation of the essential NAD biosynthetic enzymes, NadD and NadE, as antimycobacterial drug targets.
To address the problems of M. tuberculosis drug resistance and persistence of tuberculosis, new classes of drug targets need to be explored. The biogenesis of NAD cofactors was selected for target validation because of their indispensable role in driving hundreds of biochemical transformations. We hypothesized that the disruption of NAD production in the cell via genetic suppression of the essential enzymes (NadD and NadE) involved in the last two steps of NAD biogenesis would lead to cell death, even under dormancy conditions. In this study, we confirmed the hypothesis using a protein knockdown approach in the model system of Mycobacterium smegmatis. We showed that induced proteolytic degradation of either target enzyme leads to depletion of the NAD cofactor pool, which suppresses metabolic flux through numerous NAD(P)-dependent pathways of central metabolism of carbon and energy production. Remarkably, bactericidal effect was observed even for nondividing bacteria cultivated under carbon starvation conditions.
The essential coenzyme NAD plays important roles in metabolic reactions and cell regulation in all organisms. As such, NAD synthesis has been investigated as a source for novel antibacterial targets. Cross-species genomics-based reconstructions of NAD metabolism in group A streptococci (GAS), combined with focused experimental testing in Streptococcus pyogenes, led to a better understanding of NAD metabolism in the pathogen. The predicted niacin auxotrophy was experimentally verified, as well as the essential role of the nicotinamidase PncA in the utilization of nicotinamide (Nm). PncA is dispensable in the presence of nicotinate (Na), ruling it out as a viable antibacterial target. The function of the “orphan” NadC enzyme, which is uniquely present in all GAS species despite the absence of other genes of NAD de novo synthesis, was elucidated. Indeed, the quinolinate (Qa) phosphoribosyltransferase activity of NadC from S. pyogenes allows the organism to sustain growth when Qa is present as a sole pyridine precursor. Finally, the redundancy of functional upstream salvage pathways in GAS species narrows the choice of potential drug targets to the two indispensable downstream enzymes of NAD synthesis, nicotinate adenylyltransferase (NadD family) and NAD synthetase (NadE family). Biochemical characterization of NadD confirmed its functional role in S. pyogenes, and its potential as an antibacterial target was supported by inhibition studies with previously identified class I inhibitors of the NadD enzyme family. One of these inhibitors efficiently inhibited S. pyogenes NadD (sp.NadD) in vitro (50% inhibitory concentration [IC50], 15 μM), exhibiting a noncompetitive mechanism with a Ki of 8 μM.
Sugar phosphorylation is an indispensable committed step in a large variety of sugar catabolic pathways, which are major suppliers of carbon and energy in heterotrophic species. Specialized sugar kinases that are indispensable for most of these pathways can be utilized as signature enzymes for the reconstruction of carbohydrate utilization machinery from microbial genomic and metagenomic data. Sugar kinases occur in several structurally distinct families with various partially overlapping as well as yet unknown substrate specificities that often cannot be accurately assigned by homology-based techniques. A subsystems-based metabolic reconstruction combined with the analysis of genome context and followed by experimental testing of predicted gene functions is a powerful approach of functional gene annotation. Here we applied this integrated approach for functional mapping of all sugar kinases constituting an extensive and diverse sugar kinome in the thermophilic bacterium Thermotoga maritima. Substrate preferences of 14 kinases mainly from the FGGY and PfkB families were inferred by bioinformatics analysis and biochemically characterized by screening with a panel of 45 different carbohydrates. Most of the analyzed enzymes displayed narrow substrate preferences corresponding to their predicted physiological roles in their respective catabolic pathways. The observed consistency supports the choice of kinases as signature enzymes for genomics-based identification and reconstruction of sugar utilization pathways. Use of the integrated genomic and experimental approach greatly speeds up the identification of the biochemical function of unknown proteins and improves the quality of reconstructed pathways.
The toxicity of ionizing radiation is associated with massive apoptosis in radiosensitive organs. Here, we investigate whether a drug that activates a signaling mechanism used by tumor cells to suppress apoptosis can protect healthy cells from the harmful effects of radiation. We studied CBLB502, a polypeptide drug derived from Salmonella flagellin that binds to Toll-like receptor 5 (TLR5) and activates nuclear factor–κB signaling. A single injection of CBLB502 before lethal total-body irradiation protected mice from both gastrointestinal and hematopoietic acute radiation syndromes and resulted in improved survival. CBLB502 injected after irradiation also enhanced survival, but at lower radiation doses. It is noteworthy that the drug did not decrease tumor radiosensitivity in mouse models. CBLB502 also showed radioprotective activity in lethally irradiated rhesus monkeys. Thus, TLR5 agonists could potentially improve the therapeutic index of cancer radiotherapy and serve as biological protectants in radiation emergencies.
Limited or regulatory proteolysis plays a critical role in many important biological pathways like blood coagulation, cell proliferation, and apoptosis. A better understanding of mechanisms that control this process is required for discovering new proteolytic events and for developing inhibitors with potential therapeutic value. Two features that determine the susceptibility of peptide bonds to proteolysis are the sequence in the vicinity of the scissile bond and the structural context in which the bond is displayed. In this study we assessed statistical significance and predictive power of individual structural descriptors and combination thereof for the identification of cleavage sites. The analysis was performed on a dataset of >200 proteolytic events documented in CutDB for a variety of mammalian regulatory proteases and their physiological substrates with known 3D structures. The results confirmed the significance and provided a ranking within three main categories of structural features: exposure > flexibility > local interactions. Among secondary structure elements, the largest frequency of proteolytic cleavage was confirmed for loops and lower but significant frequency for helices. Limited proteolysis has lower albeit appreciable frequency of occurrence in certain types of β-strands, which is in contrast with some previous reports. Descriptors deduced directly from the amino acid sequence displayed only marginal predictive capabilities. Homology-based structural models showed a predictive performance comparable to protein substrates with experimentally established structures. Overall, this study provided a foundation for accurate automated prediction of segments of protein structure susceptible to proteolytic processing and, potentially, other post-translational modifications.
proteolysis; proteolytic processing; limited proteolysis; regulatory proteolysis; protease; cleavage site; cleavage site prediction
Proteolytic signaling, or regulated proteolysis, is an essential part of many important pathways such as Notch, Wnt, and Hedgehog. How the structure of the cleaved substrate regions influences the efficacy of proteolytic processing remains underexplored. Here, we analyzed the relative importance in proteolysis of various structural features derived from substrate sequences using a dataset of more than five thousand experimentally verified proteolytic events captured in CutDB. Accessibility to the solvent was recognized as an essential property of a proteolytically processed polypeptide chain. Proteolytic events were found nearly uniformly distributed among three types of secondary structure, although with some enrichment in loops. Cleavages in α-helices were found to be relatively abundant in regions apparently prone to unfolding, while cleavages in β-structures tended to be located at the periphery of β-sheets. Application of the same statistical procedures to proteolytic events divided into separate sets according to the catalytic classes of proteases proved consistency of the results and confirmed that the structural mechanisms of proteolysis are universal. The estimated prediction power of sequence-derived structural features, which turned out to be sufficiently high, presents a rationale for their use in bioinformatic prediction of proteolytic events.
The emergence of multidrug-resistant pathogens necessitates the search for new antibiotics acting on previously unexplored targets. Nicotinate mononucleotide adenylyltransferase of the NadD family, an essential enzyme of NAD biosynthesis in most bacteria, was selected as a target for structure-based inhibitor development. Using iterative in silico and in vitro screens we identified small molecule compounds that efficiently inhibited target enzymes from Escherichia coli (ecNadD) and Bacillus anthracis (baNadD) but had no effect on functionally equivalent human enzymes. On-target antibacterial activity was demonstrated for some of the selected inhibitors. A 3D structure of baNadD was solved in complex with one of these inhibitors (3_02) providing mechanistic insights and guidelines for further improvement. Most importantly, the results of this study help validate NadD as a target for the development of antibacterial agents with potential broad-spectrum activity.
L-rhamnose (L-Rha) is a deoxy-hexose sugar commonly found in nature. L-Rha catabolic pathways were previously characterized in various bacteria including Escherichia coli. Nevertheless, homology searches failed to recognize all the genes for the complete L-Rha utilization pathways in diverse microbial species involved in biomass decomposition. Moreover, the regulatory mechanisms of L-Rha catabolism have remained unclear in most species. A comparative genomics approach was used to reconstruct the L-Rha catabolic pathways and transcriptional regulons in the phyla Actinobacteria, Bacteroidetes, Chloroflexi, Firmicutes, Proteobacteria, and Thermotogae. The reconstructed pathways include multiple novel enzymes and transporters involved in the utilization of L-Rha and L-Rha-containing polymers. Large-scale regulon inference using bioinformatics revealed remarkable variations in transcriptional regulators for L-Rha utilization genes among bacteria. A novel bifunctional enzyme, L-rhamnulose-phosphate aldolase (RhaE) fused to L-lactaldehyde dehydrogenase (RhaW), which is not homologous to previously characterized L-Rha catabolic enzymes, was identified in diverse bacteria including Chloroflexi, Bacilli, and Alphaproteobacteria. By using in vitro biochemical assays we validated both enzymatic activities of the purified recombinant RhaEW proteins from Chloroflexus aurantiacus and Bacillus subtilis. Another novel enzyme of the L-Rha catabolism, L-lactaldehyde reductase (RhaZ), was identified in Gammaproteobacteria and experimentally validated by in vitro enzymatic assays using the recombinant protein from Salmonella typhimurium. C. aurantiacus induced transcription of the predicted L-Rha utilization genes when L-Rha was present in the growth medium and consumed L-Rha from the medium. This study provided comprehensive insights to L-Rha catabolism and its regulation in diverse Bacteria.
L-rhamnose catabolism; metabolic reconstruction; regulon; comparative genomics; Chloroflexus
Bacteroides thetaiotaomicron, a predominant member of the human gut microbiota, is characterized by its ability to utilize a wide variety of polysaccharides using the extensive saccharolytic machinery that is controlled by an expanded repertoire of transcription factors (TFs). The availability of genomic sequences for multiple Bacteroides species opens an opportunity for their comparative analysis to enable characterization of their metabolic and regulatory networks.
A comparative genomics approach was applied for the reconstruction and functional annotation of the carbohydrate utilization regulatory networks in 11 Bacteroides genomes. Bioinformatics analysis of promoter regions revealed putative DNA-binding motifs and regulons for 31 orthologous TFs in the Bacteroides. Among the analyzed TFs there are 4 SusR-like regulators, 16 AraC-like hybrid two-component systems (HTCSs), and 11 regulators from other families. Novel DNA motifs of HTCSs and SusR-like regulators in the Bacteroides have the common structure of direct repeats with a long spacer between two conserved sites.
The inferred regulatory network in B. thetaiotaomicron contains 308 genes encoding polysaccharide and sugar catabolic enzymes, carbohydrate-binding and transport systems, and TFs. The analyzed TFs control pathways for utilization of host and dietary glycans to monosaccharides and their further interconversions to intermediates of the central metabolism. The reconstructed regulatory network allowed us to suggest and refine specific functional assignments for sugar catabolic enzymes and transporters, providing a substantial improvement to the existing metabolic models for B. thetaiotaomicron. The obtained collection of reconstructed TF regulons is available in the RegPrecise database (http://regprecise.lbl.gov).
Regulatory network; Regulon; Transcription factor; BACTEROIDES; Carbohydrate utilization
In this perspective, we revise the historic notion that cancer is a disease of mitochondria. We summarize recent findings on the function and rewiring of central carbon metabolism in melanoma. Metabolic profiling studies using stable isotope tracers show that glycolysis is decoupled from the tricarboxylic acid (TCA) cycle. This decoupling is not ‘dysfunction’ but rather an alternate wiring required by tumor cells to remain metabolically versatile. In large part, this requirement is met by glutamine feeding the TCA cycle as an alternative source of carbon. Glutamine is also used in non-conventional ways, like traveling in reverse through the TCA flux to feed fatty acid biosynthesis. The biosynthetic networks linked with non-essential amino acids alanine, serine, arginine, and proline are also significantly impacted by the use of glutamine as an alternate carbon source.
metabolism; mitochondria; glutamine; systems biology; NMR
Experimental data exists for only a vanishingly small fraction of sequenced microbial genes. This community page discusses the progress made by the COMBREX project to address this important issue using both computational and experimental resources.
Hyperthermophilic bacteria from the Thermotogales lineage can produce hydrogen by fermenting a wide range of carbohydrates. Previous experimental studies identified a large fraction of genes committed to carbohydrate degradation and utilization in the model bacterium Thermotoga maritima. Knowledge of these genes enabled comprehensive reconstruction of biochemical pathways comprising the carbohydrate utilization network. However, transcriptional factors (TFs) and regulatory mechanisms driving this network remained largely unknown. Here, we used an integrated approach based on comparative analysis of genomic and transcriptomic data for the reconstruction of the carbohydrate utilization regulatory networks in 11 Thermotogales genomes. We identified DNA-binding motifs and regulons for 19 orthologous TFs in the Thermotogales. The inferred regulatory network in T. maritima contains 181 genes encoding TFs, sugar catabolic enzymes and ABC-family transporters. In contrast to many previously described bacteria, a transcriptional regulation strategy of Thermotoga does not employ global regulatory factors. The reconstructed regulatory network in T. maritima was validated by gene expression profiling on a panel of mono- and disaccharides and by in vitro DNA-binding assays. The observed upregulation of genes involved in catabolism of pectin, trehalose, cellobiose, arabinose, rhamnose, xylose, glucose, galactose, and ribose showed a strong correlation with the UxaR, TreR, BglR, CelR, AraR, RhaR, XylR, GluR, GalR, and RbsR regulons. Ultimately, this study elucidated the transcriptional regulatory network and mechanisms controlling expression of carbohydrate utilization genes in T. maritima. In addition to improving the functional annotations of associated transporters and catabolic enzymes, this research provides novel insights into the evolution of regulatory networks in Thermotogales.
carbohydrate metabolism; transcriptional regulation; regulon; comparative genomics; Thermotoga
We have recently identified the enzyme NMN deamidase (PncC), which plays a key role in the regeneration of NAD in bacteria by recycling back to the coenzyme the pyridine by-products of its non redox consumption. In several bacterial species, PncC is fused to a COG1058 domain of unknown function, highly conserved and widely distributed in all living organisms. Here, we demonstrate that the PncC-fused domain is endowed with a novel Co+2- and K+-dependent ADP-ribose pyrophosphatase activity, and discuss the functional connection of such an activity with NAD recycling. An in-depth phylogenetic analysis of the COG1058 domain evidenced that in most bacterial species it is fused to PncC, while in α- and some δ-proteobacteria, as well as in archaea and fungi, it occurs as a stand-alone protein. Notably, in mammals and plants it is fused to FAD synthase. We extended the enzymatic characterization to a representative bacterial single-domain protein, which resulted to be a more versatile ADP-ribose pyrophosphatase, active also towards diadenosine 5′-diphosphate and FAD. Multiple sequence alignment analysis, and superposition of the available three-dimensional structure of an archaeal COG1058 member with the structure of the enzyme MoeA of the molybdenum cofactor biosynthesis, allowed identification of residues likely involved in catalysis. Their role has been confirmed by site-directed mutagenesis.
The TCA cycle is the central hub of oxidative metabolism, running in the classic forward direction to provide carbon for biosynthesis and reducing agents for generation of ATP. Our metabolic tracer studies in melanoma cells showed that in hypoxic conditions the TCA cycle is largely disconnected from glycolysis. By studying the TCA branch point metabolites, acetyl CoA and citrate, as well as the metabolic endpoints glutamine and fatty acids, we developed a comprehensive picture of the rewiring of the TCA cycle that occurs in hypoxia. Hypoxic tumor cells maintain proliferation by running the TCA cycle in reverse. The source of carbon for acetyl CoA, citrate, and fatty acids switches from glucose in normoxia to glutamine in hypoxia. This hypoxic flux from glutamine into fatty acids is mediated by reductive carboxylation. This reductive carboxylation is catalyzed by two isocitrate dehydrogenases, IDH1 and IDH2. Their combined action is necessary and sufficient to effect the reverse TCA flux and maintain cellular viability.
Toll-like receptor 5 (TLR5) binding to bacterial flagellin activates NF-κB signaling and triggers an innate immune response to the invading pathogen. To elucidate the structural basis and mechanistic implications of TLR5-flagellin recognition, we determined the crystal structure of zebrafish TLR5, as a VLR-hybrid protein, in complex with the D1/D2 fragment of Salmonella flagellin, FliC, at 2.47 Å resolution. TLR5 interacts primarily with the three helices of the FliC D1 domain using its lateral side. Two TLR5-FliC 1:1 heterodimers assemble into a 2:2 tail-to-tail signaling complex that is stabilized by quaternary contacts of the FliC D1 domain with the convex surface of the opposing TLR5. The proposed signaling mechanism is supported by structure-guided mutagenesis and deletion analysis on CBLB502, a therapeutic protein derived from FliC.
Large and functionally heterogeneous families of transcription factors have complex evolutionary histories. What shapes specificities toward effectors and DNA sites in paralogous regulators is a fundamental question in biology. Bacteria from the deep-branching lineage Thermotogae possess multiple paralogs of the repressor, open reading frame, kinase (ROK) family regulators that are characterized by carbohydrate-sensing domains shared with sugar kinases. We applied an integrated genomic approach to study functions and specificities of regulators from this family. A comparative analysis of 11 Thermotogae genomes revealed novel mechanisms of transcriptional regulation of the sugar utilization networks, DNA-binding motifs and specific functions. Reconstructed regulons for seven groups of ROK regulators were validated by DNA-binding assays using purified recombinant proteins from the model bacterium Thermotoga maritima. All tested regulators demonstrated specific binding to their predicted cognate DNA sites, and this binding was inhibited by specific effectors, mono- or disaccharides from their respective sugar catabolic pathways. By comparing ligand-binding domains of regulators with structurally characterized kinases from the ROK family, we elucidated signature amino acid residues determining sugar-ligand regulator specificity. Observed correlations between signature residues and the sugar-ligand specificities provide the framework for structure functional classification of the entire ROK family.
Proline metabolism is linked to hyperprolinemia, schizophrenia, cutis laxa, and cancer. In the latter case, tumor cells tend to rely on proline biosynthesis rather than salvage. Proline is synthesized from either glutamate or ornithine; both are converted to pyrroline-5-carboxylate (P5C), and then to proline via pyrroline-5-carboxylate reductases (PYCRs). Here, the role of three isozymic versions of PYCR was addressed in human melanoma cells by tracking the fate of 13C-labeled precursors. Based on these studies we conclude that PYCR1 and PYCR2, which are localized in the mitochondria, are primarily involved in conversion of glutamate to proline. PYCRL, localized in the cytosol, is exclusively linked to the conversion of ornithine to proline. This analysis provides the first clarification of the role of PYCRs to proline biosynthesis.
Redox-sensing repressor Rex was previously implicated in the control of anaerobic respiration in response to the cellular NADH/NAD+ levels in Gram-positive bacteria. We utilized the comparative genomics approach to infer candidate Rex-binding DNA motifs and assess the Rex regulon content in 119 genomes from 11 taxonomic groups. Both DNA-binding and NAD-sensing domains are broadly conserved in Rex orthologs identified in the phyla Firmicutes, Thermotogales, Actinobacteria, Chloroflexi, Deinococcus-Thermus, and Proteobacteria. The identified DNA-binding motifs showed significant conservation in these species, with the only exception detected in Clostridia, where the Rex motif deviates in two positions from the generalized consensus, TTGTGAANNNNTTCACAA. Comparative analysis of candidate Rex sites revealed remarkable variations in functional repertoires of candidate Rex-regulated genes in various microorganisms. Most of the reconstructed regulatory interactions are lineage specific, suggesting frequent events of gain and loss of regulator binding sites in the evolution of Rex regulons. We identified more than 50 novel Rex-regulated operons encoding functions that are essential for resumption of the NADH:NAD+ balance. The novel functional role of Rex in the control of the central carbon metabolism and hydrogen production genes was validated by in vitro DNA binding assays using the TM0169 protein in the hydrogen-producing bacterium Thermotoga maritima.
NAD is a ubiquitous and essential metabolic redox cofactor which also functions as a substrate in certain regulatory pathways. The last step of NAD synthesis is the ATP-dependent amidation of deamido-NAD by NAD synthetase (NADS). Members of the NADS family are present in nearly all species across the three kingdoms of Life. In eukaryotic NADS, the core synthetase domain is fused with a nitrilase-like glutaminase domain supplying ammonia for the reaction. This two-domain NADS arrangement enabling the utilization of glutamine as nitrogen donor is also present in various bacterial lineages. However, many other bacterial members of NADS family do not contain a glutaminase domain, and they can utilize only ammonia (but not glutamine) in vitro. A single-domain NADS is also characteristic for nearly all Archaea, and its dependence on ammonia was demonstrated here for the representative enzyme from Methanocaldococcus jannaschi. However, a question about the actual in vivo nitrogen donor for single-domain members of the NADS family remained open: Is it glutamine hydrolyzed by a committed (but yet unknown) glutaminase subunit, as in most ATP-dependent amidotransferases, or free ammonia as in glutamine synthetase? Here we addressed this dilemma by combining evolutionary analysis of the NADS family with experimental characterization of two representative bacterial systems: a two-subunit NADS from Thermus thermophilus and a single-domain NADS from Salmonella typhimurium providing evidence that ammonia (and not glutamine) is the physiological substrate of a typical single-domain NADS. The latter represents the most likely ancestral form of NADS. The ability to utilize glutamine appears to have evolved via recruitment of a glutaminase subunit followed by domain fusion in an early branch of Bacteria. Further evolution of the NADS family included lineage-specific loss of one of the two alternative forms and horizontal gene transfer events. Lastly, we identified NADS structural elements associated with glutamine-utilizing capabilities.
Carbon-13 (13C) analysis is a commonly used method for estimating reaction rates in biochemical networks. The choice of carbon labeling pattern is an important consideration when designing these experiments. We present a novel Monte Carlo algorithm for finding the optimal substrate input label for a particular experimental objective (flux or flux ratio). Unlike previous work, this method does not require assumption of the flux distribution beforehand.
Using a large E. coli isotopomer model, different commercially available substrate labeling patterns were tested computationally for their ability to determine reaction fluxes. The choice of optimal labeled substrate was found to be dependent upon the desired experimental objective. Many commercially available labels are predicted to be outperformed by complex labeling patterns. Based on Monte Carlo Sampling, the dimensionality of experimental data was found to be considerably less than anticipated, suggesting that effectiveness of 13C experiments for determining reaction fluxes across a large-scale metabolic network is less than previously believed.
While 13C analysis is a useful tool in systems biology, high redundancy in measurements limits the information that can be obtained from each experiment. It is however possible to compute potential limitations before an experiment is run and predict whether, and to what degree, the rate of each reaction can be resolved.
Transcriptional regulatory networks are fine-tuned systems that help microorganisms respond to changes in the environment and cell physiological state. We applied the comparative genomics approach implemented in the RegPredict Web server combined with SEED subsystem analysis and available information on known regulatory interactions for regulatory network reconstruction for the human pathogen Staphylococcus aureus and six related species from the family Staphylococcaceae. The resulting reference set of 46 transcription factor regulons contains more than 1,900 binding sites and 2,800 target genes involved in the central metabolism of carbohydrates, amino acids, and fatty acids; respiration; the stress response; metal homeostasis; drug and metal resistance; and virulence. The inferred regulatory network in S. aureus includes ∼320 regulatory interactions between 46 transcription factors and ∼550 candidate target genes comprising 20% of its genome. We predicted ∼170 novel interactions and 24 novel regulons for the control of the central metabolic pathways in S. aureus. The reconstructed regulons are largely variable in the Staphylococcaceae: only 20% of S. aureus regulatory interactions are conserved across all studied genomes. We used a large-scale gene expression data set for S. aureus to assess relationships between the inferred regulons and gene expression patterns. The predicted reference set of regulons is captured within the Staphylococcus collection in the RegPrecise database (http://regprecise.lbl.gov).
Bacterial nicotinate mononucleotide adenylyltransferase encoded by the essential gene nadD plays a central role in the synthesis of the redox cofactor NAD+. The NadD enzyme is conserved in the majority of bacterial species and has been recognized as a novel target for developing new and potentially broad-spectrum antibacterial therapeutics. Here we report the crystal structures of Bacillus anthracis NadD in complex with three NadD inhibitors, including two analogues synthesized in the present study. These structures revealed a common binding site shared by different classes of NadD inhibitors and explored the chemical environment surrounding this site. The structural data obtained here also showed that the subtle changes in ligand structure can lead to significant changes in the binding mode, information that will be useful for future structure-based optimization and design of high affinity inhibitors.
Genome-scale prediction of gene regulation and reconstruction of transcriptional regulatory networks in bacteria is one of the critical tasks of modern genomics. The Shewanella genus is comprised of metabolically versatile gamma-proteobacteria, whose lifestyles and natural environments are substantially different from Escherichia coli and other model bacterial species. The comparative genomics approaches and computational identification of regulatory sites are useful for the in silico reconstruction of transcriptional regulatory networks in bacteria.
To explore conservation and variations in the Shewanella transcriptional networks we analyzed the repertoire of transcription factors and performed genomics-based reconstruction and comparative analysis of regulons in 16 Shewanella genomes. The inferred regulatory network includes 82 transcription factors and their DNA binding sites, 8 riboswitches and 6 translational attenuators. Forty five regulons were newly inferred from the genome context analysis, whereas others were propagated from previously characterized regulons in the Enterobacteria and Pseudomonas spp.. Multiple variations in regulatory strategies between the Shewanella spp. and E. coli include regulon contraction and expansion (as in the case of PdhR, HexR, FadR), numerous cases of recruiting non-orthologous regulators to control equivalent pathways (e.g. PsrA for fatty acid degradation) and, conversely, orthologous regulators to control distinct pathways (e.g. TyrR, ArgR, Crp).
We tentatively defined the first reference collection of ~100 transcriptional regulons in 16 Shewanella genomes. The resulting regulatory network contains ~600 regulated genes per genome that are mostly involved in metabolism of carbohydrates, amino acids, fatty acids, vitamins, metals, and stress responses. Several reconstructed regulons including NagR for N-acetylglucosamine catabolism were experimentally validated in S. oneidensis MR-1. Analysis of correlations in gene expression patterns helps to interpret the reconstructed regulatory network. The inferred regulatory interactions will provide an additional regulatory constrains for an integrated model of metabolism and regulation in S. oneidensis MR-1.
The constitutive activation of the anoxic redox control transcriptional regulator (ArcA) in Escherichia coli during aerobic growth, with the consequent production of a strain that exhibits anaerobic physiology even in the presence of air, is reported in this work. Removal of three terminal cytochrome oxidase genes (cydAB, cyoABCD, and cbdAB) and a quinol monooxygenase gene (ygiN) from the E. coli K-12 MG1655 genome resulted in the activation of ArcA aerobically. These mutations resulted in reduction of the oxygen uptake rate by nearly 98% and production of d-lactate as a sole by-product under oxic and anoxic conditions. The knockout strain exhibited nearly identical physiological behaviors under both conditions, suggesting that the mutations resulted in significant metabolic and regulatory perturbations. In order to fully understand the physiology of this mutant and to identify underlying metabolic and regulatory reasons that prevent the transition from an aerobic to an anaerobic phenotype, we utilized whole-genome transcriptome analysis, 13C tracing experiments, and physiological characterization. Our analysis showed that the deletions resulted in the activation of anaerobic respiration under oxic conditions and a consequential shift in the content of the quinone pool from ubiquinones to menaquinones. An increase in menaquinone concentration resulted in the activation of ArcA. The activation of the ArcB/ArcA regulatory system led to a major shift in the metabolic flux distribution through the central metabolism of the mutant strain. Flux analysis indicated that the mutant strain had undetectable fluxes around the tricarboxylic acid (TCA) cycle and elevated flux through glycolysis and anaplerotic input to oxaloacetate. Flux and transcriptomics data were highly correlated and showed similar patterns.
Carbohydrates are a primary source of carbon and energy for many bacteria. Accurate projection of known carbohydrate catabolic pathways across diverse bacteria with complete genomes constitutes a substantial challenge due to frequent variations in components of these pathways. To address a practically and fundamentally important challenge of reconstruction of carbohydrate utilization machinery in any microorganism directly from its genomic sequence, we combined a subsystems-based comparative genomic approach with experimental validation of selected bioinformatic predictions by a combination of biochemical, genetic and physiological experiments.
We applied this integrated approach to systematically map carbohydrate utilization pathways in 19 genomes from the Shewanella genus. The obtained genomic encyclopedia of sugar utilization includes ~170 protein families (mostly metabolic enzymes, transporters and transcriptional regulators) spanning 17 distinct pathways with a mosaic distribution across Shewanella species providing insights into their ecophysiology and adaptive evolution. Phenotypic assays revealed a remarkable consistency between predicted and observed phenotype, an ability to utilize an individual sugar as a sole source of carbon and energy, over the entire matrix of tested strains and sugars.
Comparison of the reconstructed catabolic pathways with E. coli identified multiple differences that are manifested at various levels, from the presence or absence of certain sugar catabolic pathways, nonorthologous gene replacements and alternative biochemical routes to a different organization of transcription regulatory networks.
The reconstructed sugar catabolome in Shewanella spp includes 62 novel isofunctional families of enzymes, transporters, and regulators. In addition to improving our knowledge of genomics and functional organization of carbohydrate utilization in Shewanella, this study led to a substantial expansion of our current version of the Genomic Encyclopedia of Carbohydrate Utilization. A systematic and iterative application of this approach to multiple taxonomic groups of bacteria will further enhance it, creating a knowledge base adequate for the efficient analysis of any newly sequenced genome as well as of the emerging metagenomic data.