|Home | About | Journals | Submit | Contact Us | Français|
Intracellular glucose in Escherichia coli cells imported by phosphoenolpyruvate-dependent phosphotransferase system-independent uptake is phosphorylated by glucokinase by using ATP to yield glucose-6-phosphate. Glucokinases (EC 184.108.40.206) are functionally distinct from hexokinases (EC 220.127.116.11) with respect to their narrow specificity for glucose as a substrate. While structural information is available for ADP-dependent glucokinases from Archaea, no structural information exists for the large sequence family of eubacterial ATP-dependent glucokinases. Here we report the first structure determination of a microbial ATP-dependent glucokinase, that from E. coli O157:H7. The crystal structure of E. coli glucokinase has been determined to a 2.3-Å resolution (apo form) and refined to final Rwork/Rfree factors of 0.200/0.271 and to 2.2-Å resolution (glucose complex) with final Rwork/Rfree factors of 0.193/0.265. E. coli GlK is a homodimer of 321 amino acid residues. Each monomer folds into two domains, a small α/β domain (residues 2 to 110 and 301 to 321) and a larger α+β domain (residues 111 to 300). The active site is situated in a deep cleft between the two domains. E. coli GlK is structurally similar to Saccharomyces cerevisiae hexokinase and human brain hexokinase I but is distinct from the ADP-dependent GlKs. Bound glucose forms hydrogen bonds with the residues Asn99, Asp100, Glu157, His160, and Glu187, all of which, except His160, are structurally conserved in human hexokinase 1. Glucose binding results in a closure of the small domains, with a maximal Cα shift of ~10 Å. A catalytic mechanism is proposed that is consistent with Asp100 functioning as the general base, abstracting a proton from the O6 hydroxyl of glucose, followed by nucleophilic attack at the γ-phosphoryl group of ATP, yielding glucose-6-phosphate as the product.
Growth of Escherichia coli by using various fermentable sugars as carbon sources, including glucose, maltose, galactose, and sucrose, primarily involves the phosphoenolpyruvate-dependent phosphotransferase system (PTS) (reviewed in reference 54). However, a secondary, PTS-independent system for utilization of glucose also exists, consisting of glucose uptake by galactose permease (GalP; galactose proton symporter), followed by phosphorylation by glucokinase (GlK; EC 18.104.22.168) to yield the metabolic intermediate glucose-6-phosphate. Although glk mutant strains of E. coli (43) and Bacillus subtilis (63) are not visibly physiologically impaired, this enzyme retains the important function of phosphorylating any free intracellular glucose. Free cytoplasmic glucose may arise from di-saccharide hydrolysis, for example, the cleavage of trehalose phosphate in Bacillus subtilis (25), or from metabolism of maltose or isomaltose (61). Indeed, studies of a PTS− E. coli strain have shown that a growth rate approximately 89% that of wild-type cells can be obtained by overexpression of GalP alone, suggesting that glucose transport, not GlK-dependent phosphorylation, is limiting growth (28). There is considerable industrial interest in enhancing the ability of E. coli to transport and phosphorylate glucose in a PTS-independent manner due to the ability of these strains to direct more carbon flux to aromatic synthesis pathways (20, 21, 28).
Microbial glucokinases can be divided into three families based on sequence comparisons. Group I (protein families database [PFAM] accession number PF04587) (7) consists of ATP- and ADP-dependent glucokinases (EC 22.214.171.124) from archaea (reviewed in reference 60) and have also been recently identified in eukaryotes (56). This group also includes a novel, bifunctional ADP-dependent GlK/PfK enzyme from Methanococcus jannaschii (59). Group I is the only group for which crystal structures have been determined to date. Group II glucokinases (PFAM accession numbers PF02685 and COG0837) are ATP-dependent glucokinases that do not have the classical repressor open reading frame kinase (ROK) sequence motif (69) and consist of over 50 full and partial protein sequences, including E. coli GlK (43). The overwhelming number of these sequences (49 of 52) are from bacteria, including both cyanobacteria (8 sequences) and proteobacteria (41 sequences). Group III consists of ATP-dependent glucokinases from both archaea (24) and bacteria (23, 63, 64) that possess the ROK sequence signature (PFAM accession number PF00480) and have a conserved CXCGX(2)GCXE motif (conserved Cys residues are highlighted) (42). Mutagenesis of any of these Cys residues to Ala in Bacillus subtilis GlK results in an inactive enzyme, suggesting their functional importance (42). The ATP/polyphosphate glucokinase from Mycobacterium tuberculosis (30) and glucomannokinase from Anthrobacter sp. strain KM (45) as well as the strictly polyphosphate-dependent GlK from Microlunatus phosphovorus (66) are also members of this group.
Enzymes that transfer a phosphoryl group to the 6 hydroxyl group of a hexose include both hexokinases (EC 126.96.36.199), having broad sugar specificity, and glucokinases (EC 188.8.131.52), more specific for glucose (reviewed in reference 72). In many cases, these enzymes have been somewhat arbitrarily classified as one or the other, owing to incomplete experimental data on their sugar specificity. Glucokinases that utilize ATP as a phosphoryl donor may, in addition, use other nucleoside triphosphates (37, 58) or polyphosphate (reviewed in reference 53) as substrates or both ATP and polyphosphate (30, 52). Recently, a strictly polyphosphate-dependent glucokinase (EC 184.108.40.206) has been purified from Microlunatus phosphovorus (66). GlK from E. coli has been cloned, purified, and studied kinetically (43). It is a cytoplasmic enzyme having 321 residues and a monomeric mass of 35 kDa. This enzyme shows much greater activity with glucose than with either mannose or galactose and shows no activity with fructose, thereby defining it as a glucokinase (43).
Several crystal structures of hexokinases have been determined, including hexokinase A(PI) (10) and B(PII) (4, 38), both from Saccharomyces cerevisiae, rat type I and Schistosoma mansoni hexokinase (46), human brain type I (1, 2, 3, 57), and, recently, human type IV (glucokinase) (36). Only three microbial ADP-dependent GlK structures are available, all from sequence group I. These include the enzyme from Thermococcus litoralis bound to ADP (33), Pyrococcus horikoshii GlK (70), and Pyrococcus furiosus GlK bound to glucose and AMP (34). However, no structures of microbial glucokinases from either group II or group III are currently known.
We have determined the first structure of a member of the group II microbial glucokinase family, that from E. coli O157:H7 (ecGlK). This structure reveals a dimeric enzyme that has a similar fold to human and yeast hexokinases, indicative of a common ancestral enzyme, although the sequence identity is low (16 to 18%). Key residues responsible for glucose and nucleotide binding and catalysis are conserved, both in sequence as short motifs and in structure. The structure of ecGlK is distinct from that of the ADP-dependent GlKs from Archaea. Glucose binding results in domain closure, as found in both archaeal GlKs and hexokinases.
The gene for ecGlK was amplified from E. coli O157:H7 genomic DNA (50) obtained from the American Type Culture Collection by using primers from Integrated DNA Technologies (Coralville, Iowa) and Pfu DNA polymerase (Stratagene, La Jolla, Calif.). The amplicon was cloned into a pET15 vector derivative in frame with an N-terminal, noncleavable His6 tag by using a BamHI/EcoRI cloning strategy and was transformed into E. coli BL21(DE3) for expression. For protein production, a 1-liter culture of LeMaster medium (27) containing ampicillin at a concentration of 100 μg/liter was inoculated with a 100-ml overnight culture and grown for 2 h at 37°C. isopropyl-β-d-thiogalactopyranoside (Sigma) was added at a final concentration of 0.1 mM, and the culture continued for 6 h. Cells were harvested by centrifugation (4,000 × g at 4°C for 25 min) and stored at −20°C.
For purification, the cell pellet was resuspended in 30 ml of lysis buffer (50 mM Tris-Cl [pH 7.5], 400 mM NaCl, 10 mM β-mercaptoethanol, 5% (wt/vol) glycerol, 1× BugBuster cell lysis detergent (Novagen), 300 U of bezonase nuclease (Novagen), 1.5 mg of lysozyme (Sigma), and 1 tablet of complete EDTA-free protease inhibitor cocktail (Roche Molecular Biologicals). This lysate was applied to a 2-ml bed volume of DEAE-Sepharose (Amersham) packed in an Econo column (Bio-Rad) and equilibrated with the same buffer. Following incubation, the mixture was poured into an Econo column, and the flowthrough was collected. This was applied to a 4-ml bed volume of Ni-NTA resin (Qiagen) preequilibrated in the same buffer. Following washing, first in buffer with 1 M NaCl, followed by buffer with 0.3 M NaCl, proteins were eluted by using 25 ml of 200 mM imidazole, pH 8. Protein fractions were checked for purity by sodium dodecyl sulfate and native polyacrylamide gel electrophoresis; pure fractions were concentrated, and buffer was exchanged into 20 mM Tris (pH 8), 0.2 M NaCl, 5% glycerol, and 10 mM dithiothreitol by ultrafiltration (Centriprep, Millipore). Approximately 8 mg of pure GlK protein was obtained per liter of culture. Protein concentration was determined by the method of Bradford (14).
Dynamic light scattering (DLS) was performed by using a DynaPro MSPRII molecular sizing instrument (Proterion Corp., Piscataway, N.J.) and analyzed by using Dynamics V6 software. A volume of 20 μl of protein (6.3 mg/ml) in buffer (20 mM Tris-Cl [pH 8], 0.2 M NaCl, 5% glycerol, 10 mM dithiothreitol) was analyzed in a 96-well plate at room temperature.
Crystals of apo-ecGlK were obtained by the hanging drop vapor diffusion method in drops containing 2 μl of SeMet-labeled protein (6.8 mg/ml) and 4 μl of reservoir solution [1.7 M (NH4)2SO4, 0.1 M Tris-Cl (pH 8.5)] suspended over 1 ml of reservoir solution. The crystals belong to the space group P43212 with the cell dimensions a = b = 81.5 and c = 234.7 Å; the crystals contain two molecules in the asymmetric unit.
Crystals of the ecGlK-glc complex were obtained by the hanging drop vapor diffusion method in drops containing 2 μl of SeMet-labeled protein (6.8 mg/ml) and 4 μl of reservoir solution (18.5 to 20%, wt/vol) (PEG6000; 0.1 M Tris-Cl buffer [pH 8.5], 0.2 M MgCl2) with the addition of 2 to 3 mM ADP and 2 mM glucose suspended over 1 ml of reservoir solution. The crystals belong to the space group P21 with the following cell dimensions: a = 78.5, b = 53.6, and c = 91.1 Å; β = 113.0°. The crystals contain two molecules in the asymmetric unit.
Prior to data collection the crystals were immersed for 10 s in a cryoprotectant solution containing either 2 M (NH4)2SO4, 0.1 M Tris-Cl (pH 8.5), 3 M sodium formate (for apo-ecGlK) or 23% (wt/vol) PEG6000, 0.1 M Tris-Cl buffer (pH 8.5), 0.2 M MgCl2, 20% (wt/vol) glycerol (for ecGlK-glc), mounted in a nylon loop and flash-cooled in a cold stream of N2 gas at 100 K. Data were collected at the beamlines X8C and X25 of the National Synchrotron Light Source (NSLS), Brookhaven National Laboratory, by using a quantum 4 charge-coupled device detector (X8C) or Q-315 detector (X25) and were processed with either HKL2000 (49) or d*TREK (51).
The structure of ecGlK was determined by using a three-wavelength multiwavelength anomalous diffraction experiment at the Se K edge from SeMet-labeled apo-protein (Table (Table1).1). All 10 expected Se sites were identified by using the program SOLVE (68). Density modification and model building were performed by using RESOLVE (67), resulting in a model containing 73% (472 of 642) main chain and 67% (3,284 of 4,911) total atoms. Further model building was performed by using O (35), alternating with cycles of refinement by using the program Refmac5 (47). The model has been refined to a final R factor of 0.200 and Rfree of 0.271 at a 2.3-Å resolution with no σ-cutoff. The model contains two molecules in the asymmetric unit and includes residues 2 to 321 in each monomer and 387 water molecules.
The structure of the ecGlK-glc complex was solved by molecular replacement by using the program MOLREP (71) from the CCP4 suite (73) with apo-ecGlK used as the starting model. Refinement was performed by using the program REFMAC5 (47), giving a final R factor of 0.193 and Rfree of 0.265 at a 2.2-Å resolution with no σ-cutoff. The model contains two molecules in the asymmetric unit and includes residues 3 to 321 (monomer A) and 2 to 321 (monomer B), one molecule of glucose bound to each monomer, and 465 water molecules. Data collection and refinement statistics are summarized in Table Table1.1. Both models have good geometry without outliers, as shown by the program PROCHECK (39).
Coordinates of ecGlK have been deposited in the Research Collaboratory for Structural Bioinformatics Protein Data Bank (PDB) (11) with accession codes 1Q18 (apo form) and 1SZ2 (glucose complex).
The structure of apo-ecGlK from the P43212 crystal form was determined by a three-wavelength multiwavelength anomalous diffraction experiment from SeMet-labeled protein (26) and refined to an R factor of 0.200 (Rfree = 0.271) at a 2.3-Å resolution. This model contains two molecules in the asymmetric unit and includes residues 2 to 321 in each monomer. Data collection and refinement statistics are presented in Table Table11.
Each monomer of ecGlK consists of a small α/β domain made of two noncontiguous segments (residues 2 to 110 and 300 to 321) and a larger α+β domain (residues 111 to 299) (Fig. (Fig.1).1). A total of 13 β-strands and 11 α-helices make up the monomer and are labeled consecutively from the N to C terminus, as shown in Fig. Fig.1.1. The small domain consists of a single, central, five-stranded mixed β-sheet (β3-β2-β1-β4-β7), with β2 antiparallel to the rest. This β-sheet is flanked on one face by a pair of α-helices (α1 and α2) and a β-hairpin (β5-β6) and on the opposite face by a pair of α-helices (α3 and α11). This five-turn-long α-helix (α11, residues 301 to 321) is not contiguous with the rest of the domain and comes from the C-terminal end of the monomer. It forms part of the interface between the large and small domains.
The large domain contains a mixed, six-stranded β-sheet (β8-β13-β12-β9-β10-β11) with β8 and β10 antiparallel to the rest. One face of this sheet is adjacent to a cluster of seven α-helices (α4 to α10), while the other face is directed toward the small domain. A longer central helix, α7, forms the core of the α-helix cluster. The interface between the large and small domains forms the active site cleft ~28 Å wide and ~20 Å deep. A single helix, α3 (residues 100 to 109), connects the two domains.
Analysis of purified ecGlK by DLS suggested it to be a dimer in solution. Crystallographic analysis of ecGlK revealed a dimer within the asymmetric unit. The association of ecGlK monomers to form the dimer structure (Fig. (Fig.2)2) occurs through interactions between the large domains of each monomer such that both active site clefts are solvent accessible. Secondary structure elements contributing to the dimer interface include helix α4 and adjacent loops, the C-terminal tip of helix α7, strand β10 and the loop connecting this to strand β11 (Fig. (Fig.2).2). The total buried surface area upon dimer formation is ~3,060 Å2 for both monomers, equivalent to ~10% of the accessible surface area of each monomer. Generally, ATP-dependent GlKs of bacterial or archaeal origin are dimeric enzymes, although the GlK from the archaeon Aeropyrum pernix is monomeric (24, 58), as is human glucokinase (36).
Many hydrogen bonds and van der Waals contacts are formed between the two monomers (chains A and B) of ecGlK. Hydrogen bonds are formed between the side chains of Arg150(A) and Asp148(B), as well as Asp162(A) and Lys284(B). There are also backbone H-bonds between Leu250(A) and Glu157(B), as well as numerous water-mediated hydrogen bonds. Contacts between the two monomers are also provided through stacking interactions between the side chains of Phe287 of one monomer and His160 of the other. As described in the following, Glu157 and His160 are also part of the glucose-binding site.
A recent exhaustive analysis of over 17,000 sequences of kinases and their relationship to structure (16) classified ecGlK (COG0837) within the RNase H-like kinase group, with representative structures from hexokinase (4), glycerol kinase (32), and acetate kinase (15). These kinases are also members of the sugar kinase-heat shock protein 70-actin superfamily (12, 31) and are classified within the actin-like ATPase superfamily within SCOP (5). An alignment of selected GlK protein sequences is shown in Fig. Fig.33.
A search for similar structures by using the DALI server (29; http://www.ebi.ac.uk/dali/) found that the most similar structures were human brain hexokinase I (PDB code 1QHA) (57) and hexokinase B (also known as hexokinase PII) from S. cerevisiae (PDB 2YHX) (4). Yeast hexokinase PII is somewhat larger than ecGlK, comprising 486 residues (38), while human hexokinase type I is much larger, consisting of two ~50-kDa chains (72). Each chain contains two globular units, an N-terminal regulatory domain (residues 1 to 474) and a C-terminal catalytic domain (residues 475 to 917) separated by a long α-helical linker (1). Each domain of human hexokinase I is structurally similar to the yeast hexokinase monomer (1) and to monomeric human glucokinase (36). Superposition of ecGlK with the human brain hexokinase I catalytic domain gave a root mean square deviation (rmsd) of 1.66 Å for 157 Cα atoms and 1.88 Å for 188 Cα atoms for human glucokinase, while a similar superposition between ecGlK and yeast hexokinase PII gave an rmsd of 1.77 Å for 149 Cα atoms. A redetermination of the yeast hexokinase PII structure reported by Anderson et al. (4) with the correct amino acid sequence (PDB 1IG8) (38) reveals a very similar fold for the two yeast PII hexokinase structures.
A superposition of ecGlK and yeast and human hexokinases is shown in Fig. Fig.4.4. Structural similarities are most pronounced in the core regions of the structures. This structural similarity exists despite low (~16 to 18%) sequence identity between ecGlK and the two hexokinases. Bacterial glucokinases, of which ecGlK is a member, along with yeast and human hexokinases had previously been identified as members of the “hexokinase family,” sharing several short sequence motifs and predicted to have similar folds (13). Comparison of the structures of ecGlK and human hexokinase I shows that the fold of the small domain is very similar, except for the absence of the β-hairpin (β5-β6) in human hexokinase I (Fig. (Fig.5).5). There are greater differences between the large domains (Fig. (Fig.5).5). The core mixed β-sheet (β13-β12-β9-β10-β11) is preserved in both hexokinase and ecGlK, although there is one extra β-strand (β8) at the C-terminal side of the sheet in ecGlK and one extra β-strand (β1) which comes from the N-terminal segment at the opposite side of the sheet in hexokinase (Fig. (Fig.5).5). Absent from ecGlK are specific structural features of eukaryotic hexokinases, including an N-terminal mitochondrial membrane-targeting sequence (46), and distinct, specific binding sites for the allosteric inhibitor glucose-6-phosphate (1) or nucleotide related to dissociation of hexokinase from the membrane (57). Evidently, yeast and human hexokinases as well as ecGlK evolved from a common ancestor, retaining similar overall structures while diverging in sequence.
A comparison of the structures of ecGlK and the ADP-dependent GlKs revealed no significant structural similarity between these two groups of glucokinases. Both enzymes consist of a small and a large domain, with the active site cleft between the domains. The folds of both small and large domains differ in ecGlK and the ADP-dependent GlKs. This result is consistent with the idea that ADP-dependent GlKs adopt a ribokinase-like fold (16, 33).
In rat hexokinase (46) and human hexokinase I (1), the dimer is formed through the association of N- and C-terminal domains from the two respective chains, yielding a head-to-tail arrangement. Dimerization is not essential for human hexokinase I in vitro, as monomeric enzyme retains activity (72). A comparison of the dimerization interfaces of ecGlK with the interface of human hexokinase I (PDB 1QHA) reveals that opposite faces of the respective monomers are associated with the dimer interface in these two enzymes, although the monomers themselves are structurally similar (Fig. (Fig.4).4). Similarly, the region of ecGlK involved in dimerization is not structurally conserved in the S. cerevisiae hexokinase PII (PDB 2YHX) structure (4). The monomer-dimer equilibrium of yeast PII hexokinase is influenced by pH, ionic strength, glucose concentration, and phosphorylation at Ser14 (8). Whether yeast hexokinase PII functions as a monomer or dimer in vivo is unclear.
A further difference between ecGlK and the ADP glucokinases is at the level of quaternary structure. The ADP-dependent GlK from P. furiosus does appear to be a dimer both in solution and in the crystal, with a disulfide bond between the side chains of Cys94 (34). The nearly identical enzyme activity of the C94S mutant GlK, which does not dimerize, compared with that of the native enzyme, as well as the lack of sequence conservation of this Cys residue in other GlKs suggests that the covalently linked dimer is not the physiologically relevant structure of this enzyme (34). The two ADP-dependent GlKs structurally characterized from P. horikoshii (70) and T. litoralis (33) are monomeric enzymes.
Crystals of apo-ecGlK soaked in reservoir solution containing low concentrations of glucose immediately cracked and dissolved, prohibiting structure determination. Cocrystallization experiments of ecGlK in the presence of glucose yielded a new crystal form in space group P21. The structure of the ecGlK-glc complex was determined by molecular replacement by using the native structure as the search model and was refined to an R factor of 0.193 (Rfree = 0.265) at a resolution of 2.2 Å.
Inspection of the initial Fo-Fc difference map in the active site region revealed the presence of density corresponding to bound glucose in both ecGlK monomers (Fig. (Fig.6a).6a). The bound glucose molecule is in a chair conformation and adopts the β-anomeric configuration. Both glucose molecules are well ordered with low B-factors, indicating good occupancy for their respective binding sites.
Each glucose molecule participates in an extensive hydrogen bonding network within the active site pocket (Table (Table22 and Fig. Fig.6a).6a). All of the interacting residues are highly conserved within the related sequences of group II GlKs (PFAM accession number P46880) (Fig. (Fig.3).3). The residues Asn99, Asp100, Glu157, and Glu187 are also conserved both in sequence (Fig. (Fig.3)3) and structurally (Fig. (Fig.6b)6b) in human hexokinase I. In human hexokinase I (PDB 1DGK), structurally equivalent residues to those of ecGlK (in parentheses) are Glu708 (Glu157), Gln739 (His160), Glu742 (Glu187), Asn656 (Asn99), and Asp657 (Asp100). Site-specific mutations of Asp657, Glu708, and Glu742 of human hexokinase I have been previously shown to abolish activity in vitro (6). The Glu708Ala and Glu742Ala mutations reduced the KM for glucose by 50- and 14-fold, respectively (6). A water-mediated hydrogen bond between the O6 atom of glucose and the amide N of Gly138 is also present in human hexokinase I.
The cracking and dissolving of apo-ecGlK crystals when soaked in the presence of glucose was suggestive of ligand-induced conformational changes in the enzyme, analogous to those found initially in yeast hexokinase (9, 65) and subsequently in human hexokinase I (2) and ADP-dependent GlKs (34). Superposition of yeast hexokinase and its complex with glucose revealed closing of the domains relative to one another, effectively burying the substrate (65). A similar finding has been observed with P. furiosus GlK in the presence of bound glucose (34), where comparison of this structure with the related apo-GlK from T. litoralis showed a maximal shift in Cα positions of 12 Å at the tip of the small domain.
The dimer interfaces for ecGlK and ecGlK-glc are very similar but not identical. In ecGlK-glc, the H bond between Asp148OD1(A)-Arg150NH1(B) is not maintained. Dimers of ecGlK and ecGlK-glc were superimposed by using Cα atoms of the large domains (residues 120 to 300) of both molecules of the dimer, giving an rmsd of 0.42 Å for 372 Cα atoms. While the large domains are fixed through their interactions in the dimer, the small domains in both monomers are rotated by ~15°, resulting in a maximum Cα displacement of ~10 Å (Fig. (Fig.7a).7a). As a consequence of this movement, the active site cleft becomes more closed.
Comparison of the overall structures of the two monomers of the apo-ecGlK dimer reveals that a few loops within the small domain have different conformations in the two monomers. Superposition of the two apo-ecGlK monomers by using only the large domains reveals the intrinsic conformational flexibility of this enzyme (Fig. (Fig.7b),7b), as has been previously observed with yeast hexokinase in solution studies (55). The maximal Cα displacement for residues in the small domain in this superposition is for Thr78, displaced by 7 Å, part of the loop which closes the glucose-binding site. Superposition of the two monomers gave an rmsd of 1.3 Å for 320 Cα atoms, while superposition of Cα atoms of the large domain alone gave an rmsd of 0.38 Å for 186 Cα atoms. In ecGlK-glc, glucose binding stabilizes these flexible loops, resulting in their adopting the same conformation in both monomers. These observations imply that while glucose binding stabilizes the closed form, domain-domain movements occur in apo-ecGlK independently of glucose binding and reflect the intrinsic flexibility of the enzyme.
The only structural change observed within the small domains themselves upon glucose binding is a movement of the loop consisting of residues 73 to 79. Between large and small domains, a number of hydrogen bonds are broken as a result of domain closure including those between Glu315NE2 and Trp131 O, Glu315OE1 and Trp151 N, and Asn303OD1 and Arg16NH1 as well as Thr32OG1 and Arg16NE. New hydrogen bonds formed after domain closure include those between Asn303OD and Arg16NE and between Thr32OG and Arg16NH1. Several van der Waals contacts are also broken and reformed as a consequence of domain movement. Of those residues involved in glucose binding, only Asn99 and Asp100 undergo significant movement in comparison to apo-ecGlK and glucose-bound ecGlK (Fig. (Fig.7c7c).
Although we could not obtain a complex between ecGlK and ADP/Mg2+, either in the presence or absence of glucose, a comparison of ecGlK with the structures of mutant human hexokinase I bound to ADP-glucose (3) or yeast hexokinase PII bound to sulfate (38) offers insights into the likely ATP-binding site of ecGlK. The ADP-binding site of a quadruple mutant of hexokinase I has been determined (PDB 1DGK) (3). Indeed, the nucleotide-binding site is structurally conserved in all members of this superfamily, as predicted by Bork et al. (12).
First, the positions of the sulfate anion of apo-yeast hexokinase PII and Pα of ADP from the human hexokinase binary complex superimpose, with a 0.65-Å distance between the P and S atoms, respectively. This position has been identified as a high-affinity anion binding site in a number of hexokinase structures (10, 46). In the case of human hexokinase I, the α-phosphoryl group makes hydrogen bonds with Thr680OG1 and Thr680N as well as with Thr863N. In yeast hexokinase PII, Ser419OG1, the structural equivalent of Thr863, forms a hydrogen bond with an O atom of the sulfate anion. Other hydrogen bonds with the sulfate are formed with Ser419N, Thr234OG1, and Thr234N. This last residue is the yeast equivalent of Thr680 of human hexokinase. Superposition of these models with ecGlK reveals that Thr137 of ecGlK is structurally equivalent to Thr234 of yeast hexokinase and Thr680 of human hexokinase. In addition, Thr137N could participate in a hydrogen bond with the α-phosphoryl group, analogous to that of Thr234N and Thr680N of yeast and human hexokinases, respectively.
As with other kinases, a metal ion, such as Mg2+ or Mn2+, is expected to be an essential component of the catalytic machinery (40). No cocrystal structure of hexokinase or glucokinase with bound Mg2+ has yet been reported. The side chains of Thr680 (Thr137 of ecGlK), Asp532 (Asp9 of ecGlK) or Asp861 (HK1; PDB 1DGK) appear proximal enough to the phosphoryl groups of the nucleotide binding site that they could participate in Mg2+ binding. Both Thr137 and especially Asp9 are highly conserved in the sequences of group II GlKs (Fig. (Fig.3).3). A combination of modeling (3) and electron paramagnetic resonance studies in solution (48) suggest that Mn2+ or Mg2+ may only form water-mediated interactions with the enzyme. Consistent with the importance of Asp532, the mutations Asp532Lys and Asp532Glu have been shown to decrease kcat of human hexokinase I by 1,000- and 200-fold, respectively (74). A hydrated Mg2+ binding site has also been suggested for the P. furiosus ADP-dependent glucokinase (34).
In the human hexokinase I complex, key residues interacting with ADP include Thr680OG1 (Pα), Thr683OG1 (Pβ and O5′ of ribose), and Asn537ND2 (Pβ). Asn537 is part of a loop, conserved in sequence between ecGlK (Asn14) and yeast hexokinase (Asn91), and is associated with nucleotide binding. A Thr680Val mutant of hexokinase I showed a decrease in kcat of ~2,000-fold, while the Thr680Ser mutant only decreased ~2.5-fold, showing the importance of this hydrogen bond in catalysis (74).
A conserved sequence and structural motif consisting of two β-strands connected by a loop within the small domain (residues Leu6-Leu19 of ecGlK) (Fig. (Fig.3)3) has the sequence L-(A/V)-X-D-X-G-G-T-N-X-R-X-X-L (conserved Asp and Gly residues are in boldface) and is proximal to the ATP phosphoryl group binding site. In particular, the residues equivalent to Leu6, Asp9, Gly11, Gly12, Asn14, Arg16, and Leu19 are completely conserved in the sequence alignment of ecGlK, human hexokinase, and yeast hexokinase as well as human glucokinase (Fig. (Fig.3).3). A similar motif [X2-D-(I/L/V)-G-G-(S/T-X3); conserved Asp and Gly residues are in boldface] is conserved as well in group III (ROK) glucokinases (30, 45) and is equivalent to a portion of the phosphate 1 motif identified originally by Bork et al. (13). The two conserved Gly residues contribute to formation of the loop and could form main chain hydrogen bonds with the ATP phosphates. Indeed, the mutation Gly534Ala of hexokinase I (Gly11 of ecGlK) results in a decrease of kcat by 4,000-fold (75). In the human hexokinase I-ADP-glucose complex, this loop has adopted the most closed conformation and makes direct hydrogen bonds between the phosphate O atom bridging the α and β phosphoryl groups and Ala536N, as well as between a Pβ O atom and Asn537N (Fig. (Fig.8).8). Importantly, in the human hexokinase I binary complex, Thr536 has been mutated to Ala, yet kcat/KM for neither glucose nor ATP is significantly perturbed (3). This argues that the backbone conformation of this loop is important for nucleotide binding, rather than the presence of the Thr536 side chain specifically, although modeling suggests that Thr536OG can evidently form a hydrogen bond with the Pα-Pβ bridging O (data not shown). Asn537ND2 also forms a hydrogen bond with an O atom of Pβ. We suggest that similar interactions would be found in yeast hexokinase and ecGlK in the presence of bound nucleotide and glucose. It is evident that a secondary conformational change of this conserved strand-loop-strand motif must occur upon nucleotide binding, resulting in further domain closure, in addition to that which occurs upon glucose binding (Fig. (Fig.88).
The expected chemical mechanism of catalysis for ecGlK, analogous to that of other kinases, is SN2 nucleophilic attack of the O6 atom of glucose on the electropositive P atom of the γ-phosphoryl group of ATP. Initial abstraction of the proton from the CH2OH group of O6 is presumably performed by Asp100 acting as a general base. Asp100OD2 is positioned 2.7 Å from O6 and is well oriented to fulfill this role (Fig. (Fig.6a).6a). The Asp100 side chain position is anchored by hydrogen bonds to O4 of glucose and Asn99OD1, a highly conserved residue (Fig. (Fig.3).3). This mechanism is consistent with the complete conservation of Asp100 in group II glucokinase sequences and in human hexokinase (Asp657 of human hexokinase I) and glucokinase. Of the residues involved in glucose binding, the mutant Asp657Ala showed the largest effect on activity, resulting in a reduction in kcat of 100-fold relative to wild-type enzyme (6).
In the ADP-dependent glucokinases, the residue Asp451 of T. litoralis GlK is predicted to function as a general base. This residue interacts with the O6 atom of glucose and when mutated to Ala shows a specific activity of <0.001% compared to wild-type enzyme (34). A structurally equivalent residue, Asp440 of P. furiosus GlK, is predicted to function as a general base in this enzyme (34).
In human hexokinase I, a second residue, Lys621, is also within hydrogen bonding distance of Glc O6 and has been suggested as a possible catalytic residue (46). However, there is no structural equivalent of the Lys621 residue in ecGlK. Modeling of ATP bound to ecGlK, based on the superposition with ADP-bound human hexokinase I, positions the O6 atom of glucose to within a suitable distance of the γ-phosphoryl group for in-line nucleophilic attack. The distance between O6 of glucose and the Pα atom of mutant human hexokinase I (PDB 1DGK) is 5.6 Å. To accomplish the correct orientation of the γ-phosphoryl group, the ATP β- and γ-phosphoryl groups would need to adopt an extended conformation. No specific residue required to function as a general acid, responsible for protonating ADP as the leaving group, has been identified in hexokinase, although the possibility that it arises from a water molecule coordinated to Mg2+ has been suggested (3).
The kinetic mechanism of several glucokinases has been investigated and found to have a preferred order of substrate addition and product release. In the ATP-dependent glucokinases from Zymomonas mobilis (62), Propionibacterium shermanii, (37) and rat liver hexokinase IV (glucokinase, 44), the preferred order of substrate addition is glucose (or 2-deoxyglucose) followed by ATP or Mg2+. This kinetic mechanism in consistent with the ecGlK crystal structures, in that glucose binding stabilizes a closed form of ecGlK that can bind ATP, in turn resulting in a small but important conformational change necessary to form a catalytically competent form of the enzyme.
We thank Leon Flaks (NSLS; beamline X8C) and Michael Becker (NSLS; beamline X25) for assistance in data collection, Stephane Raymond for maintenance of the computing environment, and Frederic Ouellet and J. Sivaraman for assistance in protein purification and crystallization.
This research was supported in part by the Canadian Institutes of Health Research grant 200103GSP-90094-GMX-CFAA-19924 to M.C.