Enzymes make the wonderful diversity of life possible, from thermophiles that exist under incredibly harsh conditions to the complexity of higher organisms, such as humans. However, despite their importance and our continued fascination with these often complex proteins we still have a relatively limited understanding of how they function. Since 1964, when the Enzyme Commission (EC) first published their rules for enzyme nomenclature and their system to classify the overall reaction that an enzyme performs (
1), there have been over 5000 EC numbers assigned, although 836 have been subsequently either transferred to other EC numbers, or deleted (data correct as of June 2011). The first proteins with a fully defined sequence and assigned identifier from the curated portion of UniprotKB (Swiss-Prot) (
2) were deposited in the 1980s, and the first crystal structures relating to an enzyme were deposited in the wwPDB (
3) in the early 1970s. Since then, the growth of information has been persistent (A); however, there are still some significant gaps in our knowledge (B).
Of the 4528 currently active EC numbers, only 2792 have a sequence in Swiss-Prot that has a fully assigned EC number (i.e. a catalytic activity with all four levels of the EC number assigned), and of those only 1761 also have an associated structure deposited in the wwPDB, although not all of these will have a reliable mechanism published in the primary literature. Despite this apparent lack of data, there is a great deal of knowledge available, including structures, gene sequences, mechanisms, metabolic pathways and kinetic data. However, these data tend to be spread between many different databases and throughout the literature. Most web resources relating to enzymes [such as BRENDA (
4), KEGG (
5), SABIO-RK (
6), the IUBMB Enzyme Nomenclature website (
1) and IntEnz (
7)] focus on the overall reaction, accompanied in some cases by a textual or graphical description of the mechanism. MACiE (
8,
9), which stands for Mechanism, Annotation and Classification in Enzymes, is a collaboration between the Thornton group (EMBL-EBI), Mitchell group (University of St Andrews, Scotland) and Bertini group (University of Florence, Italy) and was designed to provide a computational description of mechanism by including detailed stepwise mechanistic information for a wide coverage of both chemical space and the protein structure universe. First published in 2005 (
9), MACiE usefully complements both the mechanistic detail of the Structure–Function Linkage Database (SFLD) (
10) which provides information for a small number of rather ‘promiscuous’ enzyme superfamilies, and the wider coverage with less chemical detail provided by EzCatDB (
11) and the Catalytic Site Atlas (CSA) (
12). Entries in MACiE are linked, where appropriate, to all of these related data resources. MACiE is also proving a useful resource for understanding how enzymes catalyse the vast array of chemistry with such a (relatively) limited repertoire of catalytic entities (
13–16).
This new release of MACiE retains all the original features of previous releases, but includes enriched data content through the extension of data entries (next section), new tools for exploring the diversity of biochemical reactions in MACiE (‘New Methods for Characterizing and Comparing Enzyme Mechanisms’ section) as well as new searches and database statistics (see
Supplementary Data). Each biologically meaningful search allows the user to not only access the individual entries, but also view the data in a comparative overview page. Many of these are now available as separate links and visualization of the database online has also been updated (‘Updates to MACiE Website’ section).