|Home | About | Journals | Submit | Contact Us | Français|
To obtain structural information on the early stages of V(D)J recombination, we isolated a complex of the core RAG1 and RAG2 proteins with DNA containing a pair of cleaved recombination signal sequences (RSS). Stoichiometric and molecular mass analysis established that this signal end complex (SEC) contains two protomers each of RAG1 and RAG2. Visualization of the SEC by negative staining electron microscopy revealed an anchor-shaped particle with approximate two-fold symmetry. Consistent with a parallel arrangement of DNA and protein subunits, the N-termini of RAG1 and RAG2 are positioned at opposing ends of the complex, and the DNA chains beyond the RSS nonamer emerge from the same face of the complex, near to the RAG1 N-termini. These first images of the V(D)J recombinase in its post-cleavage state provide a framework for modeling RAG domains and their interactions with DNA.
V(D)J recombination is central to establishing a functional adaptive immune system. The large repertoire of immunoglobulins and T cell receptors is generated in many vertebrates by combinatorial rearrangement of an extensive array of variable (V), diversity (D), and joining (J) gene segments that are joined to encode the variable domains of the protein chains (Gellert, 2002).
The recombination signal sequences (RSS) that flank these gene segments are recognized, paired in a synaptic complex, and cleaved by collaboration of the lymphoid-specific proteins RAG1 and RAG2. Full-length mouse RAG1 contains 1040 aa and RAG2 527 aa. However, only the cores of RAG1 (384–1008 aa) and RAG2 (1–387 aa) are necessary for the basic recombination reaction (Cuomo and Oettinger, 1994; Sadofsky et al., 1994; Sadofsky et al., 1993; Silver et al., 1993) and for in vitro RSS binding and cleavage (McBlane et al, 1995). Most biochemical studies on the RAG proteins, including the work presented in this paper, have been done with these core proteins, because the full-length proteins are difficult to express and purify. The deleted portions of both proteins probably have regulatory functions (Jiang et al., 2005; Swanson et al., 2004). The RAG1 core contains the DDE triad that is found in many ‘cut and paste’ DNA transposases (Fugmann et al., 2000; Kim et al., 1999; Landree et al., 1999). The RAG2 core has been predicted to be a six-bladed beta propeller (Aravind and Koonin, 1999; Callebaut and Mornon, 1998). In the presence of Mg2+, the RAG1/2 complex catalyzes a double-strand break at the coding/signal junctions, leaving blunt cut signal ends (Schlissel et al., 1993) and DNA hairpins on the ends of the flanking coding sequences (McBlane et al., 1995; van Gent et al., 1997). DNA cleavage is a two-step reaction: hydrolysis of one DNA strand creates a nicked intermediate whose 3’ hydroxyl group goes on to attack the phosphodiester bond of the opposing DNA strand, resulting in a hairpin. Consistent with the two-metal-ion mechanism described for DDE phosphotransferases, Ca2+ supports binding of the RSS at the RAG1/2 active site but inhibits both cleavage steps (Hiom and Gellert, 1997; Nowotny and Yang, 2006).
There are two types of RSS, 12RSS and 23RSS, differing in the length of the spacer (12 or 23bp) between a conserved heptamer (consensus CACAGTG) and a nonamer, (preferably ACAAAAACC) (Figure S1). In all cases the heptamer sequence borders the coding V, D, or J segment. Efficient V(D)J recombination requires one RSS of each type; this is known as the 12/23 rule (Sakano et al., 1980). The preference is already evident when RAG1/2 binds to the RSSs (Hiom and Gellert, 1998), though there may be additional selectivity at the cleavage stage (West and Lieber, 1998).
In vivo, the two hairpinned coding ends are rapidly transferred to the Non-Homologous End Joining (NHEJ) DNA repair factors for joining (Lieber et al., 2006). Signal ends persist longer in vivo before being joined in a process requiring most of the same NHEJ factors. In vitro, the cleaved signal ends remain tightly bound to the RAG proteins in a particularly stable Signal-End Complex (SEC) that is resistant to high salt or heparin (Agrawal and Schatz, 1997; Jones and Gellert, 2001). The relationship between RAG1/2 and DDE transposases is also evident in the SEC since this complex is capable of transposing the signal ends into target DNA (Agrawal et al., 1998; Hiom et al., 1998; Jiang et al., 2004; Reddy et al., 2006).
The determination of the stoichiometry of RAG1 and RAG2 proteins bound to RSSs has previously been based largely on electrophoretic mobility shift assays. Results have varied, with two groups concluding that the pre-cleavage complex contains a RAG1 dimer and two RAG2 monomers, together with a 12RSS, a 23RSS and an uncertain amount of HMGB1 (Bailin et al., 1999; Swanson, 2002), while another study reported an excess of RAG1 over RAG2 (Mundy et al., 2002). The spatial conformation of the RAG1/2-DNA complex has also been obscure. In one study addressing this question by in-gel FRET measurements, distances between fluorescent probes placed at either end of 12RSS and 23RSS substrates were calculated (Ciubotaru et al., 2007). The results indicated that the DNA bends and the two RSSs cross over, but no orientation of the protein domains could be ascertained. RSS bending on binding to RAG1/2 has also been reported from circular permutation assays (Aidinis et al., 1999) and atomic force microscopy (Pavlicek et al., 2008). HMGB1, which is known to severely bend a DNA duplex, is required for efficient V(D)J cleavage in vitro and is postulated to stabilize a bent 23RSS spacer and facilitate 12/23 paired complex formation for DNA cleavage (Kim and Oettinger, 1998; van Gent et al., 1997; West and Lieber, 1998).
For quantitative biochemical characterization and to obtain three-dimensional structural information about RAG1/2 bound to RSS DNA, we generated and purified the stable post-cleavage SEC. We determined the precise stoichiometry of RAG1, RAG2, HMGB1, 12 and 23RSS in the SEC and analyzed the structure of the SEC by electron microscopy and atomic-force microscopy. By fusing the RAG1 and RAG2 cores with maltose binding protein (MBP), combined with antibody tagging, we have defined the arrangement of RAG1 and RAG2 in the complex, as well as the exit points of DNA. A model for the organization of the protein domains within the recombinase complex is proposed.
The increased stability of the RAG1/2-DNA complex after DNA cleavage was exploited to obtain sufficient quantities for characterization and structural analysis. To purify the post-cleavage complex, DNA with a 12RSS was bound to streptavidin-agarose via a biotin label on the coding flank. Catalytically active core RAG1 and RAG2 proteins, HMGB1 and 23RSS were then added step-wise, as described in Experimental Procedures, using Ca2+ to aid complex formation without DNA cleavage (Figure 1A). Unbound protein and DNA were removed with wash steps. After cleavage in Mg2+, the signal end complex (SEC) was released from the column matrix and collected. The eluate could be concentrated by ultrafiltration and contained both RAG proteins (Figure 1B). The concentrated eluate formed a major band on a native polyacrylamide gel (Figure 1C). In addition to the proteins, this band contained cleaved 12 and 23RSS in a 1:1 ratio (Figure 1D).
We tested radio-labeled SEC eluted from the streptavidin column for its ability to transpose the signal ends into plasmid DNA in vitro (Figure 1E). Transposition was more efficient with the purified SEC than starting from cleavage reactions with either the same or 10-fold greater RAG concentrations, indicating that the SEC was enriched for active RAG1/RAG2 species.
The N-terminal MBP tag on RAG1 or RAG2 protein could be removed in alternative constructs that contained a site for cleavage by the PreScission protease (Figure S2). Digestion following SEC purification resulted in a stable complex with faster migration (Figure 2A). Complexes that had MBP removed from either RAG1 or RAG2 had similar mobilities in the native gel, suggesting that the same number of MBP moieties had been removed, and therefore an equal number of RAG1 and RAG2 units were present in each SEC. When the lane from the native gel was excised and run in a second dimension on a denaturing gel, RAG1, RAG2 and HMGB1 were identified to be protein components of both digested and non-digested SEC bands (Figure 2B, i–iii). 2D electrophoresis also verified complete digestion of the alternative MBP fusions. Further purification of the SEC away from aggregates and contaminating protein and DNA was achieved using gel filtration (Figure 2C), resulting in a main peak containing RAG1, RAG2 and HMGB1 (Figure 2D).
Standards of known RAG1:RAG2:HMGB1 ratios were run to assess the stoichiometry of the protein components of the SEC (Figure 2D). Fluorescence of the Coomassie G-250 stained gel was used to quantify each protein, and the relative amounts of RAG2 and HMGB1 to RAG1 were compared to the standards (Figure 2D). In accordance with the digested complexes in the native gel, we found that there was a 1:1 ratio of RAG1 to RAG2. Slightly less than one HMGB1 was present per RAG1 dimer (~0.5/RAG1, Figure 2E). A small fraction of the SEC may lose HMGB1 during purification if it is not tightly bound. The empirical stoichiometry of the protein components of the SEC was estimated to be 2 RAG1: 2 RAG2: 1 HMGB1.
The molecular mass of the SEC was measured by multi-angle static light-scattering (MALS) analysis of the complex eluting from an in-line gel-filtration column (Figure 3A). The peak corresponding to the SEC, identified by native gel electrophoresis (Figure 3B), contained a species of average molecular mass 505 kDa (+/− 19 kDa) as averaged from 4 measurements. The measured value matches well with the calculated molecular mass (504 kDa) of an SEC comprising a 12 and 23RSS pair of DNAs (49 kDa), a single unit of HMGB1 (19 kDa), and two RAG1/2 (MR1/LR2) heterodimers (218 kDa). Given the equimolar ratio of RAG1 and RAG2, a SEC would be 722 kDa or 960 kDa if 3 or 4 RAG1/2 heterodimers were present, far outside the experimentally measured range. The A260/A280 ratio (1.27) of the SEC from gel filtration substantiated the presence of the two expected DNA fragments with the RAG1/2 heterotetramer (Table S1).
Further support for the molecular mass value was obtained by scanning transmission electron microscopy (STEM) of samples eluted from in-column complex formation (Figure 3C). The mass of the SEC as measured by STEM (calibrated against the mass per unit length of tobacco mosaic virus) presents a single peak centered at 502 ± 64 kDa (S.D.) (Figure 3D). The precision of the STEM data is insufficient to determine the amount of HMGB1 in each SEC but sufficient to corroborate that the SEC contains two RAG1–RAG2 heterodimers bound to the pair of RSS DNAs.
The purified SEC assembled from MBP-fused core RAG1 and RAG2 was adsorbed onto carbon-coated grids, negatively stained with uranyl acetate, and observed by transmission electron microscopy (EM). The micrographs showed an evenly spread population of particles of ~150 Å in the largest dimension (Figure 4A). Many of them exhibited a branched X or Y shape, while others appeared rounder. To enhance resolution, we selected 4200 SEC particles from an initially digitized set of > 10,000, and used reference-free alignment and classification methods to assign them to different internally consistent classes; the images in each class were then averaged. This procedure resulted in 8 predominant class-averages, each with distinctive features (Figure 4B). We experimented with larger numbers of classes but found this change in procedure resulted only in splitting of major classes into two or more closely related subclasses.
The most prominent finding to emerge from the class-averages is detection of a two-fold rotational axis (C2) in the complex. This axis is either in the image plane in the projections of classes i–iv and viii, or perpendicular to the image plane (class v). This symmetry was also approximately observed in classes vi and vii, which likely represent projections between the two orthogonal views of SEC particles. Although the 11 bp difference between the 12 and 23RSS should in principle render the SEC particle asymmetric, such a difference is apparently too small to be discernible, given the limited resolution of the negatively stained EM data. Thus the SEC appears to be pseudo-symmetric.
In all 8 class-average images, a well-defined core structure is seen surrounded by several additional, mostly fainter, densities. These peripheral densities are labeled as “arm” and “tail” in Figure 4B (i) and appear as protuberances on opposite sides of the rounder core shape of class v. Further sub-classification did not result in better definition of the peripheral densities, indicating that their relative faintness might result in part from local mobility. This inference was supported by variance maps calculated from the particles in each class separately, or from combining classes i to iv, which confirmed that the arm and tail regions are more variable and thus more mobile than the core (Figure S3). Interestingly, classes i and iii exhibited a better-defined density on one of the “arms”, which did not obey the C2 symmetry (Figure 4B). While it is possible that this asymmetry may results from “locking on” to the density on one side during classification and smearing out the density on other side, we think it more likely that the distinction may arise from the ‘arm’ on one side being more fully immersed in the stain layer and consequently better contrasted than its counterpart on the other side (Figure S4). The dimensions of this feature were ~ 45 × 60 Å, roughly matching the expected 2D projection of the 40 kDa MBP. MBPs are linked to the N-termini of RAG1 and RAG2 by a 25 aa flexible tether and are expected to be mobile. The peripheral location of the MBPs is also consistent with the observation that MBP fusion does not alter the recombination activity of RAG1 and RAG2 (Kriatchko et al., 2006; Mundy et al., 2002).
To better define the locations of MBPs in the SEC, we performed immuno-labeling with an anti-MBP Fab. Purified SEC particles were incubated with the Fab, negatively stained and visualized by EM. In agreement with the measured stoichiometry of two RAG1 and two RAG2 molecules in each SEC, four anti-MBP Fab extensions were observed for each SEC particle (Figure 4C). The extensions are near the “arms” and “tail” of the SEC, and their size approximately matches the combined dimensions of Fab (75 × 55 × 40 Å) and MBP (60 × 50 × 40 Å) (Figure 4C). Due to the flexibility of the MBPs’ linkers, alignment and class averaging of the Fab-bound SEC particles was not productive. However, the orientations of the Fabs are indicative of the configuration of the SEC. Two of the Fab-MBP extensions appear to protrude from adjacent points on the SEC particle and are nearly parallel, and the other two diverge from the opposite end of the SEC. Superimposed onto the unlabelled class-averages, it appears likely that the two parallel Fab extensions belong to the “tail” MBPs and the other two to the “arm” MBPs (Figure 4C). Assuming that the pseudo-C2 symmetry of SEC applies to RAG1/2 and MBP fusions, we surmised that one pair of MBPs is associated with RAG1 and the other pair to RAG2.
To distinguish the pairs of MBPs and their connections to RAG1 and RAG2, we used the AR1 and MR2 variations and removed the MBPs fused to RAG1 by protease digestion after assembly and purification of SEC particles (Figure S2). The remaining MBPs on RAG2 were labeled with the Fab and viewed by EM. The resulting images showed two extended densities for each SEC particle (Figure S5). Alignment and averaging of 67 similar particles showed that the core resembled the ‘round’ projection of SEC (class v) and the MBP-Fabs extended outwards on opposite sides (Figure 4D). Therefore MBPs fused to the RAG2s are most probably assigned as the “arms” in Figure 4B (i–iv), and the “tails” of the anchor-shaped SEC are most likely the MBPs linked to the N-termini of the RAG1 dimer.
Disregarding the peripheral MBPs, the SEC particles in all 8 class-averages show well-defined features. When the projection is normal to the C2 axis, two distinct views of SEC are recorded: (1) an anchor shape with two thick tips and a shank bisected by the vertical C2 axis and further divided into three horizontal sections (Figure 4B, i–iv), and (2) a more compact triangular shape (Figure 4B, viii). Among these five projections, the length of the SEC along the C2 axis consistently measures 125 Å (Figure S6). The width of the shank measures ~80–90 Å. Its three horizontal sections appear most distinct in class ii, where the middle section is separated into two symmetric rounded densities. The widest dimension of the SEC is 150–160 Å between the tips of the anchor-shape projection (class i–iv), falling to 135 Å in the triangular projection. These two projections are probably related by a rotation around the C2 axis, with the triangle representing a side projection of the anchor-shaped SEC (Figure 5).
When the projection was along the C2 axis (Figure 4B, v and vii), the SEC had the appearance of an approximately rectangular figure-of-eight with two diagonally opposed side-flaps. Although there were two possibilities for the SEC being projected along the C2 axis (either end of the anchor towards the carbon grid), the mirror image of the projections v and vii was absent (Figure 4B), indicating a preferential orientation of SEC binding to the carbon grid. The central figure-of-eight is 110 Å long; its width, 80 Å (Figure S6), matches that of the shank in the orthogonal view, and the span of the side flaps (~150 Å) matches the width of the anchor. Accordingly, we suspect that the tips of the anchor drag correspond to the side-flaps of the figure-of-eight, and the shank of the anchor to the figure-of-eight itself.
The central figure-of-eight appear to be made of two C shaped entities around the C2 axis. Each C-shaped entity in the figure-of-eight projection encircles a dark disc of 25 Å in diameter, and the two discs are separated by ~10 Å. These accumulations of stain may indicate a depression, an internal cavity in the particle, or positively stained material such as DNA. Dark areas were also observed in the anchor-shape projections, either forming a line along the C2 axis in Figure 4B (i) or zigzags in Figure 4B (ii). Proteins are usually negatively stained, thus appearing bright in EM. Nucleic acids (DNA or RNA) are likely to be positively stained by uranyl-acetate, and thus may be invisible in these micrographs. In our hands, the 30 bp extensions of RSS DNAs in the SEC particles (thus ~100 Å additions to the core) remained “invisible” in the EM images.
To locate the RSS DNAs, we subjected SEC particles to atomic force microscopy (AFM), a proven technique for visualizing DNA in complex with protein (Shlyakhtenko et al., 2003). To follow the DNA trajectory by AFM, the SEC was assembled using DNA substrates with 500 bp extensions beyond the RSS nonamers and purified as before. Figure 6A shows a population of SEC complexes detected by AFM. The SEC particles were of a uniform size, and the volume of the “headpiece” was consistent with the expected mass of 500 kDa (Figure S7). Depending on how the SEC particles and extended DNAs adsorbed onto the mica surface, a variety of projections was recorded. In a sample of 190 particles, however, it is clear that the two DNA chains predominantly (80% of cases) exit the SEC from adjacent points (Figure 6B, i,ii,iv–vii). More complicated configurations are seen in 10% of particles, where one of the DNA molecules loops and re-binds non-specifically to the protein complex, either at the free DNA end (Figure 6B, iii), or at an intermediate site (Figure 6B, vii). In another 10% the two DNA molecules seem to be exiting on opposite sides of the complex (Figure 6B, iii and viii). If DNAs truly exited from the SEC on opposite sides, they would never, or at least not often, appear to emerge near each other. We suspect that the apparent divergent trajectories of DNA molecules result when they emerge near each other on the SEC but bend and diverge upon attachment to the mica surface. The most populated view by AFM may correlate with the C2 axis in the mica plane and the less populated view to the C2 axis perpendicular to the plane.
In order to identify where on the complex the DNAs exit in relation to RAG1 and RAG2, the MBPs on the SEC were selectively digested and the remaining MBPs were once again labeled with Fab (Figure 6C and D). These experiments also utilized 12 and 23RSS substrates of different lengths (250 bp and 500 bp). Fab-labeled MBPs were clearly identified as globular entities peripheral to the recombinase core. When only the RAG2 MBPs were labeled, the two DNAs most often exited together from the SEC on the opposite side to the Fabs (Figure 6D, i–vi). Consistent with this observation, when only the RAG1 MBPs are labeled, they are situated closer to the exiting DNAs (Figure 6C, i–vi). The MBPs attached to RAG1 appear to be more flexible and assume more varied positions relative to the SEC than the MBPs attached to RAG2.
Extensive purification of the SEC has allowed us to obtain detailed information about its composition and structure. Several lines of evidence show that the SEC contains two protomers each of RAG1 and RAG2, and independent measurements of its molecular mass by gel filtration, light scattering, and STEM all yield a value close to 500,000 Da, implying that the SEC is composed of a RAG1–RAG2 heterotetramer and two cleaved RSS DNAs (Figure 2–Figure 3). There is also one HMG1 protomer per SEC. Earlier studies had shown that HMGB1 stimulates cleavage of a 23RSS much more than a 12RSS, suggesting that HMGB1 is likely to be associated with the 23RSS in the SEC (Aidinis et al., 1999; van Gent et al., 1997).
Our results verifying a (RAG1)2–(RAG2)2 composition of the SEC are not strictly comparable to earlier data, which refer to the complex before DNA cleavage. However, when we examined pre-cleavage synaptic complexes by AFM (Figure S9), the majority of complexes containing the 12 and 23 RSSs were no larger than tetrameric in size (although more heterogeneous than the SEC particles). In addition, there was no separated subunit in the SEC sample eluted after in-column cleavage, and the SEC retains transposition activity, also making it unlikely that there is a change in subunit composition upon cleavage.
A recent paper (Shlyakhtenko et al., 2009) reports an equimolar RAG1–RAG2 hetero-octamer in a pre-cleavage synaptic complex, the size being estimated by AFM. However, this complex, is only detected in the absence of salt and after cross-linking of the proteins with glutaraldehyde, and may represent a less stable species. We note that in our hands, low salt concentrations in the sample buffer used for AFM increased aggregation and non-specific RAG1/2-DNA interactions.
We analyzed the SEC by negative staining EM. Classification of these data led us to identify three predominant projections, each with an apparent twofold axis. Two of these projections are perpendicular to this axis and respectively have an “anchor” shape and a “triangular” shape; the other is along the twofold axis and shows a “figure-of-eight” with two symmetrically disposed side-flaps (Figure 5). These three views of the same object are analyzed further below.
Because the core RAG1 and RAG2 constructs used for preparing the SEC were fusions with MBP, it was possible to tag the SEC with monovalent anti-MBP Fab fragments, and thus localize the MBP moieties. It appears that in the “anchor” projection, all four MBPs lie in approximately the same plane. Furthermore, by selectively removing MBP from RAG1 and again tagging with the anti-MBP Fab, we were able to distinguish the two RAG proteins in the SEC. These data indicate that RAG2 is at the “drag” end (the wide end), of the anchor and their MBPs form the “arms” and the MBP attached to RAG1 at the “shank” end and form the “tails” (Figure 4),
Further information obtained by AFM was used to determine the orientation of RSS DNAs in the SEC. Because the cleaved ends are presumably concealed within the complex, one can only detect the DNA extending beyond the nonamer of the RSS. In the AFM images, the pair of ends predominantly emerges from the same side of the SEC; indicating that the DNA fragments are arrayed in generally parallel rather than antiparallel (or oblique) orientation. This result dovetails with data showing that with a short spacing between two RSSs deletional recombination is more efficient than inversional joining (Lewis and Hesse, 1991; Sheehan and Lieber, 1993), presumably because deletion requires a shorter path and fewer turns of DNA than inversion. (Figure S8) (Kim and Oettinger, 1998). Additional AFM images of Fab-tagged complexes made with either MBP-RAG1 or MBP-RAG2 revealed that the exit points of the DNA lie close to the RAG1 end and opposite the RAG2 end.
Recently an antiparallel configuration of two nonamer DNAs crystallized with the “nonamer binding domain” (NBD) of RAG1 was reported (Yin et al., 2009). The contrast to the parallel configuration as shown in the EM and AFM analyses of SEC may be in part caused by the different context of these observations. The NBD fragment (aa 389–464) has no detectable binding to the nonamer in solution and displays few sequence-specific contacts in the crystal structure (Yin et al., 2009).
Having located the RAG1 and RAG2 N-termini and nonamer exits on the SEC particle, we were able to generate a fairly detailed model of the RAG1-RAG2-DNA complex (Figure 7). Like the bipartite RSS, the primary sequence of the core RAG1 can be considered in two parts: an N-terminal nonamer-binding domain (NBD) and a larger catalytic domain that interacts with the heptamer and coding sequence (Figure S2). The NBD is placed near the shank end based on the MBP mapping and the DNA extension beyond the nonamer (Figure 4C–D and Figure 6). Biochemical data showed that both RAG1 and RAG2 contact the RSS heptamer (Eastman et al., 1999); this would place the catalytic domain of RAG1 near the drag end of the anchor. In other words, the two units of the RAG1 dimer would extend from the shank end roughly side by side and then cross over (Figure 5B), each with a RAG2 protomer attached to form the drag end. Similarly, the two RSSs extend through the length of the “anchor”. To accommodate the extra 11 bp of the 23RSS, additional bending possibly facilitated by HMGB1 will be necessary.
To carry the modeling further, we made extensive use of sequence homologies of RAG1/2 with proteins of known structures. We used the Tn5 transposase post-DNA cleavage complex (PDB: 1MUR) as a model for the dimeric RAG1 catalytic domains associated with the heptamers. However, the DDE triad (D600, D708 and E962) essential for catalysis in RAG1 is split by a predicted C2H2 zinc finger and a helix-rich region, together including residues ~720–950. This inserted domain has been shown to be important for hairpin formation and heptamer/coding sequence contacts (Huye et al., 2002; Lu et al., 2006). The eukaryotic Hermes transposase also contain a DDE triad interrupted by a helical domain (Hickman et al., 2005). Superimposition of the Hermes transposase structure onto that of Tn5 (guided by the DDE motifs) does not alter the overall shape of the protein-DNA complex. The transposase dimer and two DNA ends (Davies et al., 2000) can be fit into the EM 2D projections as the “drag” end of SEC. The fitting is particularly pleasing in the “figure-eight” projection (Figure 5A), which gives evidence of “channels”, which were also seen in the EM structure of HIV integrase attributed to its catalytic domain (Michel et al., 2009). The Tn5-DNA complex has an antiparallel orientation of transposon ends at the two active sites. We envisage that a bend in 12 or 23bp spacer, possibly accompanied by further bending in the heptamer itself (Olson et al., 1998; Patel et al., 1987), allows the heptamers to adopt a similar head-to-head configuration in SEC.
Residues 390–460 of RAG1-NBD share sequence homology with the Helix-Turn-Helix (HTH) DNA-binding domain found in the HIN recombinase (Difilippantonio et al., 1996). A parallel arrangement of RSS DNAs associated with HTH domains is predicted for the Tc1/mariner family of transposases (Richardson et al., 2006). To model the NBD of RAG1 and its associated nonamer DNA, a pair of HTH domains and associated DNAs was adopted from the transcription factor POU-HMG1-DNA complex (PDB: 1GT0). Our interpretation is that the NBD (residue 390 – 590) may consist of more than one HTH domain. We adjusted the locations of the HTH domains along the DNA major groove and applied the C2 symmetry to the HTH-DNA complex to match the “b” and “c” densities (Figure 5B). The resulting placements were further adjusted to connect the nonamer to heptamer by 12 or 23 bp spacer DNA.
The essential core of RAG2 (residues 1–352) has been predicted to fold into a six-bladed beta propeller similar to those of the C-terminus of DNA gyrase A protein (Corbett et al., 2004). The GyrA beta propeller (60 Å in diameter and 35 Å deep) was used as a model, and fits reasonably well with the side flaps in the figure-eight projection and also with the tips of the anchor shape. The position of RAG2 would presumably allow it both to interact directly with RAG1 and to influence the heptamer binding to RAG1 (Aidinis et al., 2000; Eastman et al., 1999; Swanson and Desiderio, 1998, 1999).
After multiple rounds of building and rebuilding, we generated a SEC model whose dimensions and calculated two-dimensional projections approximately match the EM class-average images. Furthermore in this model, the DNA configuration is compatible with the trans mode of DNA binding by the RAG1 dimer (Swanson, 2001) where the nonamer binding domain and active site are provided by different RAG1 subunits.
In summary, by combining electron microscopy with information from mutational studies and homologous proteins, we have built a space-filling model of the SEC that is consistent with existing biochemical data, and helps to clarify the relation between the RAG proteins and transposases. Our results will serve as a platform for further experimental work on the structure and biological functions of the RAG1/2 complex and on its interplay with factors involved in DNA repair and in the regulation of V(D)J recombination.
Core RAG1 (384–1008) and RAG2 (1–387) fused to MBP (MR1 and MR2) were made as described (McBlane et al., 1995). Alternative constructs of RAG1 (AR1) and RAG2 (LR2) contained a PreScission protease site between the MBP and core RAG sequence (Figure S2). MR2’ (with no c-myc tags) and LR2 coding regions were inserted into pLEXm vector (Aricescu et al., 2006) and transiently expressed in HEK293T cells (Swanson et al., 2004). Preparation of HMGB1 (1–163) and MBP Fab fragments is described in Supplemental Data. 23RSS and biotinylated-12RSS substrates were made from annealed oligonucleotides. DNA substrates for the AFM samples were prepared from pDVG54 plasmid (Supplemental Data, Figure S1).
500 pmol biotinylated 12RSS was bound to 0.2 ml streptavidin agarose (Novagen) in a 10 ml column. RAG-DNA complexes were assembled in the column in 10 ml binding buffer (25 mM MOPS-KOH pH 7, 50 mM potassium glutamate, 40 mM KCl, 4 mM CaCl2, 1% (v/v) glycerol, 4 mM dithiothreitol, 0.01% (w/v) dodecyl-D-maltoside, 0.4 µM AEBSF, 5 µM leupeptin, 0.5 µM pepstatin). 10 µg/ml HMGB1 then 200 µg/ml MR1/MR2 or AR1/MR2, or 100 µg/ml MR1/LR2, were added to the column at 37 °C for 30 minutes. 750 pmol 23RSS was included and incubated for another 30 minutes. The resin was washed with 20 ml of binding buffer. 5 mM MgCl2 in 0.5 ml binding buffer was added and incubated for 1 hour at 37 °C. Eluted SEC and a 0.5 ml wash were collected and concentrated in 100 mM potassium glutamate by ultrafiltration using a 100 kDa molecular weight cut-off membrane (Millipore). MBP was digested from AR1 or LR2 following SEC purification (minus protease inhibitors) with 20 µg/ml PreScission protease overnight at 4 °C. Purifications of radio-labeled SEC, analysis of SEC by gel electrophoresis, and transposition assays are detailed in Supplemental Data.
The methods are described in detail in Supplemental Data. Briefly, refractive index and multi-angle light scattering of samples eluted from a gel filtration column (TSK-G3000PWXL, Tosoh) were measured in-line using Optilab rEX and DAWN HELEOS instruments (Wyatt). Molar mass was obtained using Astra V software (Wyatt, 1993). STEM micrographs were recorded at the Brookhaven STEM facility (www.biology.bnl.gov/stem/stem.html). Intensities of SEC and TMV were measured using PCMass29 (Hainfeld et al., 1982), applying 115 Da/intensity unit. A scaling factor calculated from the measured (13.3 +/− 0.8 kDa/Å, Figure S10) and known (13.14 kDa/Å) TMV mass per length values was applied to correct the SEC mass.
AFM volume measurements were calculated from pixels values in ImageJ software (Figure S7).
10–20 µg/ml of an SEC sample was applied to a glow-discharged carbon-coated copper grid, washed with five drops of deionized water and stained with two drops of 1% uranyl acetate. Micrographs of a final pixel size of 3.174 Å were produced, as described in Supplemental Data.
4,200 SEC images were selected from 87 micrographs for good particle distribution, staining, and optical quality (appropriate defocus, lack of drift, etc). SPIDER (Frank et al., 1996) was used to center and sort the particles by a reference-free alignment. Iterative cycles of correspondence analysis, hierarchical clustering by the Ward criteria, and multireference alignment produced 8 classes, each containing at least 420 particles. The resolution of the 2D averages, estimated by Fourier ring correlation at 0.5 cut-off, varied between 26 and 29 Å (Saxton and Baumeister, 1982). EMAN (Ludtke et al., 1999) and BSOFT (Heymann and Belnap, 2007) were used at multiple steps for image visualization and processing.
We thank Drs. J. S. Wall and M. M. Simon of Brookhaven National Laboratory for STEM data acquisition. S.R.-M. is indebted to Drs. N. Cheng, C. S. Smith, U. Baxa and D. Winkler for help with EM data acquisition, to Drs. G. Effantin, N. Mizuno and U. Baxa for guidance in image analysis, and to Drs. M. Nowotny and J. Y. Lee for insightful discussions. We thank C. Vander Kooi, P. Longo, J. McLellan and D. J. Leahy for help in establishing the mammalian expression system.
S.R.-M. was a recipient of a Human Frontiers Science Program fellowship. This work was supported by the intramural research programs of NIDDK, NIBIB and NIAMS of the National Institutes of Health.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.