|Home | About | Journals | Submit | Contact Us | Français|
Restriction endonuclease MvaI recognizes the sequence CC/WGG (W stands for A or T, ‘/’ designates the cleavage site) and generates products with single nucleotide 5′-overhangs. The enzyme has been noted for its tolerance towards DNA modifications. Here, we report a biochemical characterization and crystal structures of MvaI in an apo-form and in a complex with target DNA at 1.5Å resolution. Our results show that MvaI is a monomer and recognizes its pseudosymmetric target sequence asymmetrically. The enzyme consists of two lobes. The catalytic lobe anchors the active site residues Glu36, Asp50, Glu55 and Lys57 and contacts the bases from the minor grove side. The recognition lobe mediates all major grove interactions with the bases. The enzyme in the crystal is bound to the strand with T at the center of the recognition sequence. The crystal structure with calcium ions and DNA mimics the prereactive state. MvaI shows structural similarities to BcnI, which cleaves the related sequence CC/SGG and to MutH enzyme, which is a component of the DNA repair machinery, and nicks one DNA strand instead of making a double-strand break.
Restriction endonucleases (REases) are usually classified according to their specificity, target site symmetry, oligomeric structure and/or the presence of auxiliary domains that modulate the activity (1). A less widely used criterion for REase classification is the length of sticky ends in cleaved DNA duplexes. Grouping REases by this criterion reveals a very uneven distribution in the frequency of different cleavage patterns. Indeed, 4 nucleotide (nt) 5′-overhangs are most abundant. Moreover, protruding 5′-ends are more common than recessed 5′-ends and overhangs with an even number of nucleotides outnumber those with an odd number (2). It is currently thought that different cleavage patterns are mainly achieved by the changes at the dimer interface between the conserved monomers of restriction enzymes (3,4). However, the crystal structure of Ecl18kI, which cleaves the CCNGG sequence and generates 5nt 5′-overhangs demonstrated that Ecl18kI achieves its specificity by a flip of the central nucleotides rather than by a rearrangement of the dimer interface (5). This finding suggests that the cleavage pattern may reflect fundamental structural differences and prompts further studies of endonucleases that generate rare cleavage patterns. Here, we focus on MvaI, the first example of a restriction enzyme that recognizes a pseudopalindromic sequence CC/WGG (W stands for A or T) and generates 1nt 5′-overhangs upon cleavage.
The MvaI REase is a part of the MvaI restriction–modification (RM) system discovered in Micrococcus varians strain RFL19 (now renamed to Kocuria varians). The MvaI REase is accompanied by the cognate methyltransferase (MTase) which methylates the N4-amino group of the inner cytosine residue making the host DNA resistant to MvaI cleavage (6). The modification of the same cytosine to 5mC, which can be catalyzed by several physiologically unrelated MTases, is not protective against MvaI cleavage (7).
The MvaI REase is unusual in many respects. (a) The enzyme generates 1nt 5′-overhangs (6). (b) MvaI cleaves unmodified DNA duplexes in two single-strand scissions (8). (c) The cleavage rates of the two strands are significantly different (9). (d) Unlike the prototype EcoRII REase, MvaI is very tolerant of modifications in synthetic substrates (10). The enzyme completes the cleavage of substrates with a nick in one strand, irrespective of whether or not a phosphate is present at the nicking site (8). Moreover, selective modification of one DNA strand typically affects the cleavage of either the modified or the unmodified strand, depending on the nature of the modification, but only very drastic modifications impair cleavage of both strands (7,10).
Here, we show that MvaI is a monomer in solution that binds duplex DNA in 1:1 stoichiometric ratio and we present a crystallographic analysis of MvaI in the apo- and DNA-bound forms both at 1.5Å resolution. The crystallographic results show that MvaI interacts with a complete recognition site and not just with a half-site of the pseudosymmetric target sequence. Therefore, we suggest that the enzyme acts as a ‘nickase’ initially, but may swing around its DNA target to make a second cut that completes the cleavage reaction.
Escherichia coli strain ER2566 (F–λ–fhuA2 [lon] ompT lacZ::T7 gene1gal sulA11 Δ(mcrC-mrr)114::IS10R(mcr-73::miniTn10–TetS)2R(zgb-210::Tn10)(TetS) endA1 [dcm]) was used as a host for cloning and overexpression experiments. ER2566 strain was shown experimentally to exhibit the dcm− phenotype.
The MvaI REase (mvaIR) gene was amplified from the cloning vector kindly provided by Fermentas (Vilnius, Lithuania). The mvaIR gene was re-cloned into the pBAD24 (Apr) vector and expressed in the ER2566 strain bearing plasmid pACYC184 (Cmr) containing pspGIM gene which was a gift from New England Biolabs (USA). The expression vector for MvaI REase encoded full-length MvaI with three additional MKS amino acids at the N-terminus. To express MvaI REase the ER2566 strain containing the pACYC184_M.PspGI was transformed with pBAD24_R.MvaI. Cells were grown in LB medium with appropriate antibiotics at 37°C to OD600 0.7 and MvaI REase expression induced by addition of arabinose to the final 0.2% concentration. After 4h of induction the cells were harvested by centrifugation and the pellet was stored at −20°C.
Frozen cells were thawed and resuspended in 4 volumes of buffer A (0.01M Tris-HCl, pH 8.0at 25°C, 1mM EDTA, 7mM 2-mercaptoethanol) with 0.1M NaCl. The cell suspension was sonicated and the cell debris was removed by centrifugation at 18000 r.p.m for 1h. The supernatant was loaded on phosphocellulose P11 column (Whatman) equilibrated with buffer A with 0.1M NaCl. The MvaI protein was not bound to P11 beads, the flow-through was collected and subjected to subsequent chromatography on the Heparin Sepharose, Blue Sepharose (Amersham Biosciences) and phosphocellulose P11 columns (Whatman). The protein was eluted from all columns with a linear NaCl gradient in buffer A. All purification steps were monitored by λ DNA cleavage and SDS-PAGE. Fractions containing MvaI REase activity were pooled and dialyzed against storage buffer (0.01M Tris-HCl, pH 8.0 at 25°C, 0.4M KCl, 1mM EDTA, 1mM DTT, 50% glycerol) and stored at −20°C. The protein was purified to electrophoretic homogeneity in a Coomassie-blue-stained SDS-gel. The overall yield of MvaI purification was 10mg from 5l culture.
Analytical ultracentrifugation was done in an An-60 rotor with a Coulter-Beckman Model XL-A analytical ultracentrifuge equipped with UV absorption detection using charcoal filled epon centrepieces. Sedimentation velocity was measured at 4°C and 44000 r.p.m. in a buffer containing 0.01M Tris-HCl, pH 7.4, 0.4M KCl, 0.1mM DTT and 0.1mM EDTA at a loading concentration of 2.5µmol. Absorption profiles were recorded at 230nm to increase the sensitivity of detection. The sedimentation coefficient distribution was analyzed with the program SEDFIT (11). Viscosity and density correction to calculate s20°C,W was done using the data supplied by the program package SEDNTERP (http://www.rasmb.bbri.org). Sedimentation equilibrium was measured in six-channel centrepieces at 18000 r.p.m. and 20°C. Samples were spun until no change in absorbance profiles could be observed for at least 12h at which time equilibrium was assumed to have been reached. Molar masses were evaluated from the concentration gradients observed in these last 12h as described earlier (12).
Here, 30μg (1nmol) of MvaI protein was mixed with blunt-ended 9-mer cognate oligoduplex 1 (Figure 1A) in buffer B (0.01M Tris-HCl, pH 8.0, 0.3M KCl, 5mM CaCl2, 1mM DTT, 5% glycerol) at different stoichiometric ratios, and incubated on ice for 1h. The samples were loaded on a Superose ™ 12 HR 10/30 column (Amersham Biosciences), which was equilibrated in buffer B and run with a flow rate of 0.4ml/min. The column was calibrated with Biorad protein standards (vitamin B-12, 1.35kDa; myoglobin, 17kDa; ovalbumin, 44kDa; IgG, 150kDa and thyroglobin, 670kDa). For the interpolation of unknown molecular mass, a linear dependence of the logarithm of the molecular mass on the elution time was assumed.
Elution profiles were monitored by an Ettan (Amersham) two-wavelength detector at 260 and 280nm. The A(260):A(280) absorbance ratios necessary for profile deconvolution were deduced from the ratios of A(260) and A(280) peak heights after injection of only protein or only DNA. For our system, we determined A(260):A(280) ~1.9 for DNA and A(260):A(280) ~0.56 for MvaI without DNA. Absolute absorbance values were calculated as follows: for the double-stranded DNA, an A(260) = 1cm−1 was taken to correspond to 0.15mM nucleotides or 8.33μM of the 9-mer oligoduplex. For MvaI apo-form, an extinction coefficient of 34 380M−1cm−1 was calculated by the EXPASY server (http://www.expasy.ch) (13). After some elementary algebraic manipulations, it follows that cprotein = −21.3A(260) + 40.8A(280) and cDNA = 11.7A(260) − 6.5A(280), if absorbances and concentrations are measured in units of cm−1 and μM, respectively.
Native gel electrophoresis was run in acidic conditions according to a protocol from Dr Lebendiker (The Hebrew University, Jerusalem, http://wolfson.huji.ac.il/purification/Protocols/PAGE_Acidic.html). Electrophoresis was performed at 4°C at 25mA. Care was taken to reverse the polarity relative to the usual arrangement, because at pH 4.3 the proteins migrate as positively charged species. Here, 5μg (0.17 nmol) of MvaI was mixed in 10μl of buffer B with oligoduplex 1 (Figure 1A) in various stoichiometric ratios. Mixtures were incubated on ice overnight, supplemented with 2μl of loading dye (1.45ml of 50% glycerol, 0.5ml of 0.25M acetate-KOH, traces of methylene green) and immediately loaded on the gel.
Crystallization was done by the vapor diffusion technique at room temperature. Initial high-throughput screens were set up at the 200-nl scale using a Cartesian pipetting robot and 96-well Greiner sitting drop plates. Crystallization trials with larger drop volumes were pipetted in CRYSCHEM plates (Hampton Research).
MvaI was dialyzed against buffer B and concentrated to 7mg/ml. Initial screening identified Hampton Research Crystal Screen 2 buffer 34 (0.05M cadmium sulfate hydrate, 0.1M HEPES pH 7.5, 1.0M sodium acetate trihydrate) as promising for crystallization. The best crystals appeared in a drop containing 1.5µl of protein, 1.5µl of buffer C (0.05M cadmium sulfate hydrate, 0.1M HEPES pH 7.74, 1.2M sodium acetate trihydrate) and 0.3µl of 1M glycine as an additive, which was equilibrated against buffer C in the reservoir. The largest crystal was flash-cryocooled in a drop containing 16µl buffer C, 2µl of 1M glycine and 2µl of (2R,3R)-(−)-2,3-butandiol. Diffraction improved after a single annealing step.
in 1:1 molar ratio and incubated overnight on ice. The protein–DNA complex (final concentration 6.4mg/ml) was applied on a 24-well plate with appropriate buffers. The crystals appeared in a drop containing 2µl of protein and DNA solution and 2µl of buffer D (0.1M HEPES pH 7.89, 0.2M CaCl2, 25% PEG4000). Crystals were flash-cryocooled in crystallization buffer D supplemented with 20% of glycerol.
All diffraction data were collected at 100K. In-house data were measured on a RUH300 generator with copper anode from MSC/Rigaku equipped with Osmic multilayer optics and an MAR345 image plate. Synchrotron data were collected at beamline BW6 at DESY, Hamburg. All data were processed with MOSFLM (14) and scaled with SCALA (15) (Table 1).
This crystal form contained two MvaI molecules in the asymmetric unit. It was solved by the SAD method, taking advantage of the presence of Cd2+ ions in the crystallization buffer, which bound to some defined sites on the protein surface. Five Cd2+ sites were identified with reasonable statistics (CCall 28.7, CCweak 17.3) by the SHELXD program (16). Initial phasing with these sites in SHELXE (17) resulted in a significant contrast difference (0.61 for the correct hand versus 0.45 for the incorrect hand assuming 50% solvent content). Optimal SAD phases were calculated by the SHARP program and then extended to the full resolution of the synchrotron dataset. The extended phases were of sufficient quality for ARP/wARP (18) to automatically build 333 amino acids in 19 chains or 67% of the total number of residues that are chemically present in the MvaI crystals. The partial model was sufficient to derive the non-crystallographic (NCS) symmetry, which could then be used to map fragments between monomers. The ‘symmetrized’ model was then used as the starting point for a further round of model building, and then polished manually.
This crystal form contained two complexes of MvaI bound to duplex DNA in the asymmetric unit. Molecular replacement was unsuccessful with the complete MvaI model from the DNA-free crystal form, but two copies of the recognition domain could be placed using the MOLREP program. The initially weak model phases were improved using the protocol for NCS-phased refinement from the CCP4i interface. The resulting phases were of sufficient quality for automatic model building using ARP/wARP (18). DNA was built manually starting from a model of B-DNA of the proper sequence generated with the modeling program 3DNA (19).
Manual corrections to the models were done with the programs O (20) and XtalView (21). All structures were refined with the maximum likelihood program REFMAC (22) treating each lobe of MvaI and each DNA duplex as separate TLS entities. Data collection and refinement statistics are summarized in Table 1.
At the sequence level, MvaI shows weak similarities to the monomeric DNA repair protein MutH (23). This result prompted us to check the oligomerization state of MvaI in solution. Analytical ultracentrifugation shows MvaI to be mostly monomeric. Sedimentation velocity runs in the analytical ultracentrifuge with MvaI gave a sedimentation constant of s20°C,W = 2.62S. Using a mass for the monomeric protein of 28.6kg/mol this corresponds to a frictional ratio of 1.27. For spherical hydrated proteins, a frictional ratio of 1.1–1.2 is expected (24) and thus MvaI can be viewed as a mostly globular, monomeric particle. Sedimentation equilibrium gave a molar mass of 35kg/mol indicating the protein to show some aggregation. This aggregation could be suppressed by the addition of 0.8M GuaHCl where a molar mass of 27.5kg/mol is observed.
The monomeric state of MvaI in the apo-form and in complex with the cognate, blunt-ended 9-bp oligoduplex 1 (Figure 1A) was independently analyzed by analytical gel filtration. Experiments were run in the presence of calcium ions, which support DNA binding, but not hydrolysis (data not shown). Although MvaI and MvaI–DNA complexes are not resolved on the column, the amounts of MvaI and DNA in the elution peaks could be separately quantified at 260 and 280nm, respectively, and deconvoluted based on the known A(260):A(280) absorbance ratios for MvaI and DNA (see Materials and methods). For consistency, the deconvolution procedure was applied to all profiles, even if only protein or only DNA was injected (Figure 1B–F).
In the experimental conditions (see Materials and methods), MvaI alone elutes from the column at 13.82ml, which translates into a molecular mass of 30 ± 3kDa, in agreement with the calculated monomer mass 28.6kDa (Figure 1B,G). The cognate 9-bp oligoduplex 1 (Figure 1A) alone elutes much later, at 15.47ml (Figure 1C), but it coelutes with MvaI up to a stoichiometric ratio of 1 oligoduplex per 1 MvaI monomer (Figure 1D and E). If oligoduplex is present in excess over the MvaI monomer, two peaks result. One equivalent of DNA coelutes with the protein, and the rest elutes as free DNA (Figure 1F). The retention volume for the MvaI–DNA complex is 14.01ml and corresponds to an apparent molecular mass value of 26 ± 2kDa (Figure 1G) which is slightly lower than the apparent mass of MvaI alone, probably because the more compact shape of the complex nearly balances the increase in molecular mass due to DNA binding. We conclude that MvaI remains monomeric in the presence of cognate DNA, at least in our experimental conditions. Our findings are inconsistent with a prior report of MvaI dimerization in the presence of both cognate and non-cognate DNA (25).
The gel filtration experiments suggest that MvaI is a monomer that binds target duplex DNA in 1:1 stoichiometric ratio. This was independently confirmed by gel electrophoresis under acidic conditions (see, Materials and methods). In these conditions, free DNA does not enter the gel, but MvaI alone (predicted isoelectric point 6.8) migrates into the gel (Supplementary Figure 1, lane 1). The addition of substoichiometric amounts of cognate oligoduplex leads to the appearance of a second, slower migrating band, the MvaI–DNA complex (Supplementary Figure 1, lane 2). If oligoduplex is added in stoichiometric ratio, the band from MvaI alone almost disappears, and essentially only the MvaI–DNA band is present (Supplementary Figure 1, lane 3). Still further increase of the DNA concentration has no effect on the band pattern (Supplementary Figure 1, lane 4).
The available biochemical data on MvaI and our findings about its oligomeric state and DNA-binding stoichiometry suggested that MvaI is a highly unusual restriction enzyme and prompted us to determine its structure. MvaI was crystallized in the apo-form and in the presence of oligoduplex 2 (Figure 2). The crystals with and without DNA happened to be monoclinic and diffract to 1.5Å resolution. The apo-form was solved by the SAD method, and the form with DNA by molecular replacement, using the previously determined model of the DNA-free form (details in Materials and methods and Table 1).
Both crystal forms of MvaI contained two monomers in the asymmetric unit. In the apo-structure, the interaction of the monomers, which are related by a curious 2-fold symmetry, buries ~1800Å2 of solvent-exposed surface. Despite this extensive contact, the interaction cannot be physiological, because (a) it is not observed in solution, (b) the interface is not conserved and (c) it locks MvaI in an open conformation that is not compatible with DNA binding (see below). In the crystals of the MvaI–DNA complex, the largest interfaces between adjacent molecules in the crystal bury only ~900 and ~600Å2. Moreover, these contacts relate molecules that cannot be mapped on each other by a simple 2-fold rotation, and they differ from the extensive contact in the apo-MvaI structure. Therefore, we conclude that MvaI crystallized as a monomer in all cases. The two molecules in the apo-MvaI structure are very similar, probably because the local 2-fold axis enforces it. Likewise, the two MvaI molecules in the asymmetric unit of the crystals with DNA overlap almost perfectly, probably because the complexes with DNA are very compact and therefore rigid. The same is true for the bound DNA duplexes, which can be described by very similar conformational parameters according to the 3DNA program (19) (Supplementary Table 1), even though no restraints or constraints were applied during refinement. For simplicity, we will not distinguish between the two monomers in each crystal form in the following text.
MvaI is organized into two lobes which we term the catalytic lobe (residues 1–63, 160–188 and 238–246, orange in Figure 3) and the recognition lobe (residues 64–159 and residues 189–237, green in Figure 3) based on their mechanistic roles discussed below. The orientation of the two lobes differs radically depending on whether or not DNA is bound: in the absence of DNA, the molecule has an almost flat appearance (Figure 3A), but in the presence of DNA, it forms a tight ‘clamp’ around it (Figure 3B). Despite the drastic change in hinge angle (~54°), there is no major conformational difference within the lobes, except in loops (Figure 3C and D and Supplementary Figure 2). The rearrangement of residues 40–57 is significant, because this region includes catalytic residues.
The catalytic lobe is organized around a four-stranded mixed β-sheet which is flanked by two α-helices and a 310-helix (orange part of Figure 4). The topology of the sheet is +1x, +1, +1, +1 according to the Richardson nomenclature (26). Note that elements of the recognition lobe are interspersed between strands βC1 and βC2 and also between strands βC3 and βC4. Strands βC2 and βC3 are connected by a simple hairpin. The fold of the catalytic lobe is fairly similar to the fold of its counterpart in MutH, which has been termed the ‘N-arm’ of this enzyme (27). There are also other, more distant similarities to the cores of other REases of the PD … EXK family (data not shown).
The term ‘catalytic’ lobe was chosen because this part of MvaI anchors all catalytic residues of the enzyme. In addition, it also contributes some of the minor grove interactions with DNA, which are unlikely to play a major role in sequence discrimination (see below).
In contrast to the catalytic lobe, the recognition lobe is characteristic of MvaI. The fold is organized around two antiparallel β-sheets (green part of Figure 4). The larger of the two sheets consists of strands βR1, βR2, βR3, βR7 and βR8/βR9, which are connected in +4x, −1, −1, −1 topology. This would correspond to the Greek key motif if the small βR2 is not taken into account. The smaller sheet is built from strands βR4, βR5 and βR6 and is a β-meander. Protein architecture dictates two preferred ways for β-sheets to stack against each other: aligned, with an angle between the strands in the two sheets ~30°, and perpendicular, with an angle between the strands ~90° (28,29). The MvaI recognition lobe clearly belongs to the latter group and therefore resembles a half-barrel or barrel (30,31) except for the lack of hydrogen bonds to connect the sheets.
The term ‘recognition’ lobe was chosen because this lobe anchors the residues that interact specifically with the major grove of DNA and likely mediate sequence discrimination. We also note that nearly all basic residues (Lys64, Lys72, Arg85, Lys90, His100, Arg107, Lys159, Lys205, His214) that interact with the phosphodiester backbone of DNA are located in the recognition lobe.
MvaI acts as a monomer and recognizes its pseudosymmetric target sequence asymmetrically (Figures 2 and and3B).3B). As the enzyme has only one active site, this implies that only one strand can come into proximity of the active site. Although MvaI can bind target DNA in two orientations, the strand with the central T (‘T-strand’) binds exclusively or predominantly close to the active site in our crystal form of MvaI with DNA.
As expected, MvaI contacts the specifically recognized target bases, but in addition it also engages in hydrogen bonds with a flanking base pair. In all positions, the catalytic lobe approaches the bases exclusively from the minor grove side, and the recognition lobe interacts with the bases exclusively from the major grove side. For the detailed discussion, we follow the T-strand from the −3 to the +2 position (from left to right according to the scheme in Figure 2).
A − 3 T + 3: This A-T base pair is not part of the recognition sequence, but it nevertheless makes two direct hydrogen bonds with MvaI. The O2 and O4 atoms of thymine accept hydrogen bonds from the side chain amide group of Asn45 and the OH of Tyr213, respectively. From the structural perspective, it can be expected that these interactions contribute little to sequence specificity: the position of the O2 atom, the so-called ‘outer minor grove’ is taken by a hydrogen bond acceptor for all four possible base pairs (32). On the major grove side, the side chain oxygen atom of Tyr213 can act as a hydrogen bond donor as in the crystallographically observed complex, but might also act as a hydrogen bond acceptor, if other base pairs are present in this position (Figure 5A).
C − 2 G + 2: This C-G base pair forms only indirect hydrogen bonds with MvaI on the minor grove side, but is involved in two direct hydrogen-bonding interactions with the enzyme on the major grove side. The N4 atom of cytosine donates a hydrogen bond to the main chain carbonyl oxygen atom of Asp224 and the O6 atom of guanine accepts a hydrogen bond from the Nε atom of His223. This interpretation requires that His223 is either charged or is in the tautomeric state with the proton on the N atom (Figure 5B and data not shown).
C − 1 G + 1: This C-G base pair makes an indirect hydrogen bond to Arg25 on the minor grove side and interacts with His225 on the major grove side. The main chain carbonyl oxygen atom of this residue accepts a hydrogen bond from the N4 atom of cytosine, and the Nδ atom of its imidazole ring donates a hydrogen bond to the O6 atom of guanine. For the interaction between the imidazole ring and the base to be sequence selective, the tautomerization state of His225 should be locked by interactions within MvaI, and not just by the hydrogen bond to the base. As the Nε atom of His225 interacts with a water molecule that could be either a donor or an acceptor, if and how this ‘lock’ is provided remains unclear (Figure 5C and data not shown).
T0 A0: This T-A base pair accepts a direct hydrogen bond from Thr29 Oγ to the thymine O2 atom and an indirect hydrogen bond to the adenine N3 atom. As the ‘outer minor grove’ positions are taken by hydrogen bond acceptors for all possible base pairs, this interaction probably contributes little to specificity. On the major grove side, Arg209 donates a hydrogen bond to the O4 oxygen atom of the T-strand thymine in the crystallized MvaI–DNA complex. There is no trace of the alternative binding mode, which swaps purine and pyrimidine and must occur as well in solution (Figure 5D).
G + 1 C − 1: This G-C base pair is involved in extensive hydrogen-bonding interactions. On the minor grove side, Asn28 accepts a direct hydrogen bond from the N2 atom of guanine and anchors a water molecule that donates a hydrogen bond to the O2 atom of cytosine. On the major grove side, the carboxylate of Asp207 accepts a direct and a water-mediated hydrogen bond from the cytosine N4 atom. The guanidino group of Arg209 and Thr68 Oγ atoms donate hydrogen bonds to the O6 and N7 atoms of guanine, respectively (Figure 5E).
G + 2 C − 2: This G-C base pair makes only major grove interactions with MvaI. The guanidino group of Arg230 donates hydrogen bonds to the guanine atoms O6 and N7, and the Thr102 Oγ atom accepts a hydrogen bond from the cytosine N4 atom (Figure 5F).
The methylation sensitivity of MvaI has been extensively studied with special emphasis on the differential effects of methylation on the hydrolysis of the two DNA strands. Experimentally, it was found that 5mC can replace cytosine in all positions (Supplementary Table 2), but this would have been difficult to predict from the crystallographic results. In silico introduction of 5mC instead of the outer and inner cytosines of the T-strand brings the extra methyl groups within 2.9Å of Gly226Cα and within 3.0Å of His225 O, respectively, if base positions are not adjusted and the enzyme is kept rigid. Similarly, in silico conversion of the cytosines to 5mC in the A-strand would introduce methyl groups 2.3Å away from Thr102 Oγ and 3.3Å away from His100Cβ, respectively, and would additionally require the displacement of a water molecule. Therefore, the experimentally observed tolerance of MvaI to 5mC has to be attributed either to flexibility of the enzyme, or alternatively to the adjustability of the exact positions of the bases (data not shown).
In contrast to the effects of C5 methylation, the consequences of N4 methylation on T-strand cleavage can be readily explained by the crystal structure. N4mC cannot replace the inner cytosine in either strand, because the extra methyl group would clash with a hydrogen bond acceptor of the protein (His225 O, Figure 5C and Asp207 Oδ, Figure 5E). Likewise, N4mC in the outer position of the T-strand is not tolerated, again because the methyl group would clash with a hydrogen bond acceptor on the protein (Asp224 O in this case, Figure 5B). Experimentally, substitution of the A-strand outer cytosine with N4mC has been reported not to interfere with cleavage: apparently the methyl group can displace the side chain of Thr102 (Figure 5F).
Substitution of the central adenine with N6mA interferes with A-strand cleavage, but does not affect T-strand cleavage. The latter result is consistent with the crystal structure, because the methyl group of N6mA only needs to displace a water molecule to fit in (Figure 5D).
The rules for T-strand cleavage have direct implications for A-strand cleavage, and therefore the above results can be summarized in four simple rules: (a) substitution of cytosine with 5mC has no effect. (b) Substitution of the inner cytosine with N4mC in one strand blocks cleavage of both strands. (c) Substitution of an outer cytosine in one strand by N4mC abolishes cleavage of this strand, but does not interfere with cleavage of the complementary strand. (d) Substitution of the central adenine with N6mA affects A-strand, but not T-strand cleavage. Together, these four rules correctly predict the outcome of a large number of experiments on the methylation sensitivity of MvaI (Supplementary Table 2).
Note that the MvaI MTase methylates the N4 atoms of the inner cytosines. Rule (b) predicts correctly that this modification interferes with DNA cleavage. Conversely, the Dcm MTase converts the same cytosine to 5mC, which should not have an effect on DNA cleavage. This is consistent with the experimental observation that DNA from dcm+ strains can be cleaved by MvaI (Supplementary Table 2).
MvaI crystals with and without DNA were grown in the absence of Mg2+, but in the presence of Ca2+ ions, which support DNA binding (see Figure 1), but not DNA hydrolysis (data not shown). In the DNA-free form, Cd2+ ions were present in addition to the Ca2+ ions, but no metal ions were found in the vicinity of the active site residues. This unexpected result is due to the arrangement of residues 40–57, which are present in radically different conformations in the apo-MvaI structure (Figure 6A) and in the complex with DNA (Figure 6B).
In the productive orientation two metal-binding sites are formed, which are occupied by Ca2+ ions from the buffer (the identification of the metals is supported by the ligand distances and the X-ray anomalous signal). In both standard electron density maps and anomalous difference Fourier maps, the peak heights for the two metals are very different. The weaker peak (yellow ball in Figure 6C) corresponds to a Ca2+ liganded to an oxygen atom of the scissile phosphate, one Asp50 Oδ atom and three or four water molecules (depending on which molecule in the asymmetric unit is used for the analysis). The stronger peak corresponds to a hexa-coordinated Ca2+ ion with an almost perfect octahedral coordination sphere. The ligands to this metal ion are the other Oδ oxygen atom of Asp50, the Oε atom of Glu55, the main chain carbonyl oxygen atom of Ile56, an oxygen atom of the scissile phosphate, which acts as a bridge between the two metals, and two solvent molecules (Figure 6C).
One of these solvent molecules, which could either be a water molecule or a hydroxide ion (shown with its electron density in Figure 6C) is positioned exactly on the line that links the O3′ oxygen atom of the C−1 residue to the phosphorus atom of the scissile phosphate. This solvent molecule is ideally positioned for an in-line attack on the phosphate atom, which might proceed via a bipyramidal transition state and would correctly predict reaction products with a free 3′-end and a phosphorylated 5′-end.
Although the above catalytic mechanism appears plausible, the reaction does not proceed in the crystals, as evidenced by several detailed features of the crystal structure. (a) There is robust electron density for the potentially scissile phosphorus oxygen bond, suggesting that this bond is predominantly not cleaved. (b) At least at the present resolution, there is no significant deformation of the tetrahedral geometry at the phosphorus atom towards a trigonal bipyramidal arrangement. (c) The putative ‘catalytic’ water molecule is 3.3Å away from the phosphorus atom. This distance is not significantly smaller than the sum of the van der Waals radii, if coordinate errors of the crystal structure are taken into account, but it is significantly larger than a typical phosphorus oxygen bond in DNA (~1.6Å). Therefore, it seems that the reaction has either not started in the crystal structure, or alternatively that it is trapped very early in the trajectory, which is consistent with the biochemical observation that Ca2+ ions in the active site support DNA binding, but not DNA cleavage. Why is the reaction blocked with Ca2+ ions in the active site and would proceed with Mg2+ ions in the active site? Typical oxygen ligand distances are much shorter for Mg2+ than for Ca2+ (2.1Å versus 2.4Å) (33), which might lead to a slight mispositioning of the nucleophilic water molecule in the complex with Ca2+. However, more sophisticated simulation studies attribute the different efficiencies of the Mg2+ and Ca2+ forms of REases to kinetic effects and not to the properties of the prereactive states of these enzymes (34,35).
Nearly all restriction enzymes that recognize palindromic or pseudopalindromic sequences form functional dimers or higher order assemblies of dimers that match the 2-fold symmetry or pseudosymmetry of their target sequence (35). Nevertheless, our suggestion that MvaI interacts with its (pseudo)palindromic target sequence as a monomer already has precedence among REases. MspI and HinP1I, which are specific for the palindromic sequences C/CGG and G/CGC, respectively, have both been reported to be monomers in solution (36–38).
In the case of MspI, the authors of the crystal structure have proposed several possible explanations for how a monomeric restriction enzyme can generate double-strand breaks in DNA. (a) Although monomers are present in the absence of DNA, there might be a monomer–dimer equilibrium in the presence of DNA. Therefore, an initial monomer–DNA complex might recruit another monomer for the cleavage reaction. (b) Alternatively, MspI might achieve double-strand DNA cleavage by bringing together two monomers bound on two separate recognition sites. (c) MspI might cut strands sequentially, possibly in a concerted manner by a flip of the enzyme around the DNA after the first cleavage reaction (36).
In the case of HinP1I, a conserved dimer is present in crystals of the apo-form and of the DNA complex, despite the predominantly monomeric nature of the enzyme in solution (37). This unusual back-to-back dimer can bind two copies of DNA, which are recognized asymmetrically and are in contact with one active site (38). As expected, only one strand was cleaved if these crystals were soaked with Mg2+ ions. In solution, HinP1I treatment of circular DNA with multiple recognition sites leads first to a nicked product, but linear DNA appears earlier than one might expect from a completely random nicking reaction, suggesting either that the likelihood of strand hydrolysis at any HinP1I site increases when the other strand is cleaved or that the enzyme remains in the vicinity of its site after hydrolysis and can rebind (38).
The results reported here show that MvaI is a monomer that recognizes its target DNA asymmetrically. The data explain the cleavage rate differences for the two DNA strands (8,9) and are consistent with a possible sequential cleavage model. As MvaI forms a ‘clamp’ around the DNA, rotation of MvaI around the long axis of DNA seems plausible (Figure 3B). However, such a rotation alone can never lead to a second productive binding event, simply because the two DNA strands run in opposite direction. To enable cleavage of the second strand by the same active site, a rotation around the 2-fold axis of pseudosymmetry perpendicular to the DNA is required to position the enzyme in a second catalytically productive orientation. The crystallographically observed hinge mobility might be important to straighten out the protein clamp, so that this rotation can proceed without clashes with the DNA. However, alternative models to explain the endonuclease activity of MvaI such as hairpin formation (39,40) cannot yet be ruled out.
Our biochemical and crystallographic results indicate that MvaI is monomeric in the presence and absence of DNA and conflict with a prior report that MvaI dimerizes in the presence of DNA (25). The crystal structure of the MvaI-DNA complex shows that two monomers of MvaI cannot simultaneously interact specifically with the target DNA, because they would compete for interactions with the same bases. Moreover, two MvaI monomers which would be related by the (pseudo) 2-fold symmetry of the target sequence would also clash in various other places (not shown). Nonetheless, if MvaI dimerization on the DNA is indeed relevant in some conditions, then hinge mobility of MvaI could explain how one MvaI monomer could recruit another monomer in a second productive orientation without clashes or competition for target bases.
To our knowledge, MvaI is the first crystallographically studied REase that generates products with single nucleotide 5′-overhangs. The overhangs are 1nt shorter than the 2nt overhangs generated by MspI and HinP1I. This emphasizes the versatility of monomeric REases with respect to the cleavage pattern. In principle, such monomeric REases could generate overhangs of any length, simply by asymmetrically binding a palindromic or pseudopalindromic recognition sequence with the center of symmetry or pseudosymmetry placed in the appropriate distance from the scissile phosphate bond. Note that although DNA recognition is asymmetric, the target sequence has to be symmetric or pseudosymmetric, simply because for double-strand breaks to occur, the DNA must be recognized in both orientations by the enzyme. The requirement for symmetry or pseudosymmetry of the target might be relaxed in cases with degenerate sequence recognition.
MvaI recognizes the same sequence as EcoRII, but generates single nucleotide 5′-protruding ends while EcoRII and PspGI produce five nucleotide 5′-overhangs. A comparison of the MvaI and EcoRII (41) protomers shows that these two enzymes are barely related [the DALI score 0.9 for the pairwise superposition is insignificant (42)] The similarity is restricted to the catalytic core, which is essentially conserved in all restriction enzymes. Moreover, PspGI and EcoRII are dimers and show similarities to the Ecl18kI enzyme specific for the/CCNGG sequence (5) rather than MvaI. This example indicates once more that cleavage stagger and not only the target sequence should be considered as the ‘phenotype’ of a restriction enzyme for the statement ‘phenotype predicts genotype’ (3) to hold true.
Weak sequence similarity at the PSI-BLAST level, which has been noted before (23), suggests that MvaI is related to MutH, a component of the DNA-repair machinery in bacteria, which acts as a nickase on hemi-methylated DNA (43). Despite the functional differences, there are many parallels between MvaI and MutH. Both proteins are two-lobed monomers and recognize almost symmetric target sequences asymmetrically. A comparison of the X-ray structures confirms the similarities, particularly in the vicinity of the active sites, but also reveals substantial differences in the recognition domains, which is not surprising because the two enzymes recognize unrelated target sequences (CC/WGG for MvaI versus/GATC for MutH). At the sequence level, MvaI is strikingly similar to BcnI, which recognizes the related sequence CC/SGG (S stands for C or G) and cleaves it like MvaI. In accompanying manuscripts, we present the crystal structures of BcnI with and without target DNA and compare them in detail with the corresponding structures for MvaI and MutH.
Supplementary Data are available at NAR Online.
We thank Prof. Hans Bartunik for generous allocation of beamtime on BW6 (DESY, Hamburg) and Dr Gleb Bourenkov and Dr Galina Kachalova for assistance with data collection. We are grateful to AB ‘Fermentas’ for the MvaI RM system clone and NEB for the PspGI methylase clone. Research in the V.S. laboratory was supported by the Howard Hughes Medical Institute (HHMI) International Research Scholar grant # 55000336. M.B. thanks the European Molecular Biology Organization (EMBO) and HHMI for the Young Investigator award. H.C. is grateful to the Polish Ministry of Science and Higher Education for a POL-POSTDOC grant (PBZ/MEiN/01/2006/24). This work was supported by the European Commission 5th Framework Programme project ‘Center of Excellence in Molecular Bio-Medicine Contract no: QLK6-CT-2002-90363’ (Warsaw) and ‘Center of Exellence – Biocell’ Contract no: QLK2-CT-2002-30575 (Vilnius). The funds from the Polish Ministry of Science and Information Technology for the purchase of a crystallization robot (Decision Nr. 5210/IA/1789/2005) are gratefully acknowledged. The apo-MvaI and the MvaI–DNA structures will be available upon acceptance of the article from the Protein Data Bank (http://www.rcsb.org) under accession codes 2OA9 and 2OAA, respectively. Funding to pay the Open Access publication charge was provided by UNESCO/Polish Academy of Sciences Cellular and Molecular Biology Network.
Conflict of interest statement. None declared.