|Home | About | Journals | Submit | Contact Us | Français|
MACiE (Mechanism, Annotation and Classification in Enzymes) is a publicly available web-based database, held in CMLReact (an XML application), that aims to help our understanding of the evolution of enzyme catalytic mechanisms and also to create a classification system which reflects the actual chemical mechanism (catalytic steps) of an enzyme reaction, not only the overall reaction.
Enzymes are proteins that catalyse the repertoire of chemical reactions found in nature and can range in molecular weight from several thousand to several million daltons. They have a wonderful diversity and complexity in the reactions they perform with incredible stereo- and regio-selectivity. These reactions are ubiquitous and essential to the chemistry of life. A great deal of knowledge, including structures, gene sequences, mechanisms, metabolic pathways and kinetic data, now exists. However it is spread between many different databases and throughout the literature. Here we announce the completion of the initial version of MACiE, a unique database of the chemical mechanisms of enzymatic reactions.
Web resources such as BRENDA (Schomburg et al. (2004)), KEGG (Kanehisa et al. (2004)) and the IUBMB Enzyme Nomenclature website (IUBMB (2005)) contain descriptions of the overall reactions performed by enzymes, accompanied in some cases by a textual or graphical description of the mechansim. MACiE is unique in combining detailed stepwise mechanistic information (including 2-D animations), a wide coverage of both chemical space and the protein structure universe, and the chemical intelligence of CMLReact (Holliday et al. (2005)). MACiE usefully complements both the mechanistic detail of the Structure-Function Linkage Database (SFLD) for a small number of enzyme superfamilies (Pegg et al. (2005)) and the wider coverage with less chemical detail provided by EzCatDB (Nagano (2005)) which also contains a limited number of 3-D animations.
All MACiE enzymes are also contained in the Enzyme Commission (EC) classification system (IUBMB (2005)), that is they all have four number codes describing their overall reaction. The first level (Class) describes the basic reaction type, of which there are six defined: oxidoreductases (1), transferases (2), hydrolases (3), lyases (4), isomerases (5) and ligases (6). The second and third levels (subclass and sub-subclass, respectively) describe the reaction in further detail and the final level (serial number) describes substrate specificity. For example the β-lactamases, Fig. 1, are assigned the EC number 188.8.131.52, i.e. a hydrolase (3) acting on a C-N bond (5) in a cyclic amide (2) with a β-lactam as the substrate (6).
In MACiE, data centre on the catalytic steps involved in the chemical mechanism as well as the overall reaction. Each entry includes:
The criteria defined in the Design section initially produced a dataset of 100 entries. A single EC number may cover a plurality of MACiE entries when different mechanisms bring about the same overall chemical transformation, as with the two types of 3-dehydroquinate dehydratase, and thus 100 MACiE entries span only 96 EC numbers.
The 100 enzymes in Version 1 of MACiE incorporate domains from 140 CATH homologous superfamilies. MACiE currently covers 56 of the 174 EC sub-subclasses present in the PDB, thus, we feel that we have a representative coverage of EC reaction space (comparative EC wheels are available at URL http://www-mitchell.ch.cam.ac.uk/macie/ECCoverage/). We anticipate that all 158 sub-subclasses for which both structures and reliable mechanisms are available will be represented in the forthcoming MACiE Version 2.
The data are initially entered in MDL’s ISIS/Base, a database package for chemical reactions, validated by at least two people, and then converted into CMLReact using the Jumbo Toolkit (Wakelin et al. (2005)) to create an information and semantically rich database. At this stage we add extra fields of information to the CMLReact version of MACiE that are unavailable in the ISIS version, including the CATH code. Jumbo is a set of Java based software which converts the MDL file format produced from ISIS/Base into CMLReact. The MacieConverter section of Jumbo performs the following functions:
Once the conversion process has been completed, a further tool in the Jumbo Toolkit, called CMLSnap (Holliday et al. (2004)), can be used to create an animation of the reaction. This animation includes all of the atoms and bonds involved as well as the electron movements, which are calculated automatically. It is expected that CML will become our primary method of data entry and storage.
The annotation process involves input and validation steps. Terms have been rigorously defined either from the IUPAC Gold Book (McNaught et al. (1997)), such as chemical terms like hydrolysis, or from primary literature, such as mechanism, which is defined using Ingold’s terminology (Ingold (1969)), originally put forward in the 1930’s. All of the technical and scientific terms used in MACiE are contained in the MACiE dictionary, which is available at the URL http://www-mitchell.ch.cam.ac.uk/macie/glossary.html and is also available as a raw XML file.
The entries online are accessed via an HTML look-up table and include all of the information available in the database. The original ISIS/Base format file and the raw CML files can be supplied.
Future work includes expanding the dataset to include a representative set of EC numbers (at the sub-subclass level), creating a search interface for MACiE and developing authoring tools for MACiE in CML. Ongoing research focuses on the evolution of enzyme catalysis and the classification of enzyme reaction mechanisms.
We would like to thank the EPSRC (GLH & JBOM), BBSRC (GJB & JMT - CASE studentship in association with Roche Products Ltd; NMOB & JBOM - grant BB/C51320X/1), the Chilean Government’s Ministerio de Planificación y Cooperación and Cambridge Overseas Trust (DEA) for funding and Unilever for supporting the Centre for Molecular Science Informatics. GJB thanks Dr Jonathan Goodman for his invaluable help with organic chemistry queries.