PMCC PMCC

Search tips
Search criteria

Advanced
Results 1-10 (10)
 

Clipboard (0)
None

Select a Filter Below

Journals
Year of Publication
Document Types
1.  What we can learn about Escherichia coli through application of Gene Ontology 
Trends in microbiology  2009;17(7):269-278.
How we classify the genes, products, and complexes that are present or absent in genomes, transcriptomes, proteomes, and other datasets helps us place biological objects into subsystems with common functions, see how molecular functions are used to implement biological processes, and compare the biology of different species and strains. Gene Ontology (GO) is one of the most successful systems for classifying biological function. Although GO is widely used for eukaryotic genomics, it has not yet been widely used for bacterial systems. The potential applications of GO are currently limited by the need to improve the annotation of bacterial genomes with GO and to improve how prokaryotic biology is represented in the ontology. In this review, we will discuss why GO should be adopted by microbiologists, and describe recent efforts to build and maintain high-quality GO annotation for Escherichia coli as a model system.
doi:10.1016/j.tim.2009.04.004
PMCID: PMC3575750  PMID: 19576778
2.  EcoCyc: fusing model organism databases with systems biology 
Nucleic Acids Research  2012;41(D1):D605-D612.
EcoCyc (http://EcoCyc.org) is a model organism database built on the genome sequence of Escherichia coli K-12 MG1655. Expert manual curation of the functions of individual E. coli gene products in EcoCyc has been based on information found in the experimental literature for E. coli K-12-derived strains. Updates to EcoCyc content continue to improve the comprehensive picture of E. coli biology. The utility of EcoCyc is enhanced by new tools available on the EcoCyc web site, and the development of EcoCyc as a teaching tool is increasing the impact of the knowledge collected in EcoCyc.
doi:10.1093/nar/gks1027
PMCID: PMC3531154  PMID: 23143106
3.  The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases 
Nucleic Acids Research  2011;40(D1):D742-D753.
The MetaCyc database (http://metacyc.org/) provides a comprehensive and freely accessible resource for metabolic pathways and enzymes from all domains of life. The pathways in MetaCyc are experimentally determined, small-molecule metabolic pathways and are curated from the primary scientific literature. MetaCyc contains more than 1800 pathways derived from more than 30 000 publications, and is the largest curated collection of metabolic pathways currently available. Most reactions in MetaCyc pathways are linked to one or more well-characterized enzymes, and both pathways and enzymes are annotated with reviews, evidence codes and literature citations. BioCyc (http://biocyc.org/) is a collection of more than 1700 organism-specific Pathway/Genome Databases (PGDBs). Each BioCyc PGDB contains the full genome and predicted metabolic network of one organism. The network, which is predicted by the Pathway Tools software using MetaCyc as a reference database, consists of metabolites, enzymes, reactions and metabolic pathways. BioCyc PGDBs contain additional features, including predicted operons, transport systems and pathway-hole fillers. The BioCyc website and Pathway Tools software offer many tools for querying and analysis of PGDBs, including Omics Viewers and comparative analysis. New developments include a zoomable web interface for diagrams; flux-balance analysis model generation from PGDBs; web services; and a new tool called Web Groups.
doi:10.1093/nar/gkr1014
PMCID: PMC3245006  PMID: 22102576
4.  EcoCyc: a comprehensive database of Escherichia coli biology 
Nucleic Acids Research  2010;39(Database issue):D583-D590.
EcoCyc (http://EcoCyc.org) is a comprehensive model organism database for Escherichia coli K-12 MG1655. From the scientific literature, EcoCyc captures the functions of individual E. coli gene products; their regulation at the transcriptional, post-transcriptional and protein level; and their organization into operons, complexes and pathways. EcoCyc users can search and browse the information in multiple ways. Recent improvements to the EcoCyc Web interface include combined gene/protein pages and a Regulation Summary Diagram displaying a graphical overview of all known regulatory inputs to gene expression and protein activity. The graphical representation of signal transduction pathways has been updated, and the cellular and regulatory overviews were enhanced with new functionality. A specialized undergraduate teaching resource using EcoCyc is being developed.
doi:10.1093/nar/gkq1143
PMCID: PMC3013716  PMID: 21097882
5.  EcoCyc: A comprehensive view of Escherichia coli biology 
Nucleic Acids Research  2008;37(Database issue):D464-D470.
EcoCyc (http://EcoCyc.org) provides a comprehensive encyclopedia of Escherichia coli biology. EcoCyc integrates information about the genome, genes and gene products; the metabolic network; and the regulatory network of E. coli. Recent EcoCyc developments include a new initiative to represent and curate all types of E. coli regulatory processes such as attenuation and regulation by small RNAs. EcoCyc has started to curate Gene Ontology (GO) terms for E. coli and has made a dataset of E. coli GO terms available through the GO Web site. The curation and visualization of electron transfer processes has been significantly improved. Other software and Web site enhancements include the addition of tracks to the EcoCyc genome browser, in particular a type of track designed for the display of ChIP-chip datasets, and the development of a comparative genome browser. A new Genome Omics Viewer enables users to paint omics datasets onto the full E. coli genome for analysis. A new advanced query page guides users in interactively constructing complex database queries against EcoCyc. A Macintosh version of EcoCyc is now available. A series of Webinars is available to instruct users in the use of EcoCyc.
doi:10.1093/nar/gkn751
PMCID: PMC2686493  PMID: 18974181
6.  Multidimensional annotation of the Escherichia coli K-12 genome 
Nucleic Acids Research  2007;35(22):7577-7590.
The annotation of the Escherichia coli K-12 genome in the EcoCyc database is one of the most accurate, complete and multidimensional genome annotations. Of the 4460 E. coli genes, EcoCyc assigns biochemical functions to 76%, and 66% of all genes had their functions determined experimentally. EcoCyc assigns E. coli genes to Gene Ontology and to MultiFun. Seventy-five percent of gene products contain reviews authored by the EcoCyc project that summarize the experimental literature about the gene product. EcoCyc information was derived from 15 000 publications. The database contains extensive descriptions of E. coli cellular networks, describing its metabolic, transport and transcriptional regulatory processes. A comparison to genome annotations for other model organisms shows that the E. coli genome contains the most experimentally determined gene functions in both relative and absolute terms: 2941 (66%) for E. coli, 2319 (37%) for Saccharomyces cerevisiae, 1816 (5%) for Arabidopsis thaliana, 1456 (4%) for Mus musculus and 614 (4%) for Drosophila melanogaster. Database queries to EcoCyc survey the global properties of E. coli cellular networks and illuminate the extent of information gaps for E. coli, such as dead-end metabolites. EcoCyc provides a genome browser with novel properties, and a novel interactive display of transcriptional regulatory networks.
doi:10.1093/nar/gkm740
PMCID: PMC2190727  PMID: 17940092
7.  Escherichia coli K-12: a cooperatively developed annotation snapshot—2005 
Nucleic Acids Research  2006;34(1):1-9.
The goal of this group project has been to coordinate and bring up-to-date information on all genes of Escherichia coli K-12. Annotation of the genome of an organism entails identification of genes, the boundaries of genes in terms of precise start and end sites, and description of the gene products. Known and predicted functions were assigned to each gene product on the basis of experimental evidence or sequence analysis. Since both kinds of evidence are constantly expanding, no annotation is complete at any moment in time. This is a snapshot analysis based on the most recent genome sequences of two E.coli K-12 bacteria. An accurate and up-to-date description of E.coli K-12 genes is of particular importance to the scientific community because experimentally determined properties of its gene products provide fundamental information for annotation of innumerable genes of other organisms. Availability of the complete genome sequence of two K-12 strains allows comparison of their genotypes and mutant status of alleles.
doi:10.1093/nar/gkj405
PMCID: PMC1325200  PMID: 16397293
8.  EcoCyc: a comprehensive database resource for Escherichia coli 
Nucleic Acids Research  2004;33(Database Issue):D334-D337.
The EcoCyc database (http://EcoCyc.org/) is a comprehensive source of information on the biology of the prototypical model organism Escherichia coli K12. The mission for EcoCyc is to contain both computable descriptions of, and detailed comments describing, all genes, proteins, pathways and molecular interactions in E.coli. Through ongoing manual curation, extensive information such as summary comments, regulatory information, literature citations and evidence types has been extracted from 8862 publications and added to Version 8.5 of the EcoCyc database. The EcoCyc database can be accessed through a World Wide Web interface, while the downloadable Pathway Tools software and data files enable computational exploration of the data and provide enhanced querying capabilities that web interfaces cannot support. For example, EcoCyc contains carefully curated information that can be used as training sets for bioinformatics prediction of entities such as promoters, operons, genetic networks, transcription factor binding sites, metabolic pathways, functionally related genes, protein complexes and protein–ligand interactions.
doi:10.1093/nar/gki108
PMCID: PMC540062  PMID: 15608210
9.  SdeK Is Required for Early Fruiting Body Development in Myxococcus xanthus 
Journal of Bacteriology  1998;180(17):4628-4637.
Myxococcus xanthus cells carrying the Ω4408 Tn5lac insertion at the sde locus show defects in fruiting body development and sporulation. Our analysis of sde expression patterns showed that this locus is induced early in the developmental program (0 to 2 h) and that expression increases approximately fivefold after 12 h of development. Further studies showed that expression of sde is induced as growing cells enter stationary phase, suggesting that activation of the sde locus is not limited to the developmental process. Because the peak levels of sde expression in both an sde+ and an sde mutant background were similar, we conclude that the sde locus is not autoregulated. Characterization of the sde locus by DNA sequence analysis indicated that the Ω4408 insertion occurred within the sdeK gene. Primer extension analyses localized the 5′ end of sde transcript to a guanine nucleotide 307 bp upstream of the proposed start for the SdeK coding sequence. The DNA sequence in the −12 and −24 regions upstream of the sde transcriptional start site shows similarity to the ς54 family of promoters. The results of complementation studies suggest that the defects in development and sporulation caused by the Ω4408 insertion are due to an inactivation of sdeK. The predicted amino acid sequence of SdeK was found to have similarity to the sequences of the histidine protein kinases of two-component regulatory systems. Based on our results, we propose that SdeK may be part of a signal transduction pathway required for the activation and propagation of the early developmental program.
PMCID: PMC107477  PMID: 9721305
10.  Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology 
Briefings in Bioinformatics  2009;11(1):40-79.
Pathway Tools is a production-quality software environment for creating a type of model-organism database called a Pathway/Genome Database (PGDB). A PGDB such as EcoCyc integrates the evolving understanding of the genes, proteins, metabolic network and regulatory network of an organism. This article provides an overview of Pathway Tools capabilities. The software performs multiple computational inferences including prediction of metabolic pathways, prediction of metabolic pathway hole fillers and prediction of operons. It enables interactive editing of PGDBs by DB curators. It supports web publishing of PGDBs, and provides a large number of query and visualization tools. The software also supports comparative analyses of PGDBs, and provides several systems biology analyses of PGDBs including reachability analysis of metabolic networks, and interactive tracing of metabolites through a metabolic network. More than 800 PGDBs have been created using Pathway Tools by scientists around the world, many of which are curated DBs for important model organisms. Those PGDBs can be exchanged using a peer-to-peer DB sharing system called the PGDB Registry.
doi:10.1093/bib/bbp043
PMCID: PMC2810111  PMID: 19955237
Genome informatics; Metabolic pathways; Pathway bioinformatics; Model organism databases; Genome databases; Biological networks; Regulatory networks

Results 1-10 (10)