V(D)J recombination is the vertebrate gene rearrangement process for assembling the antigen receptor genes, immunoglobulins and T cell receptors (
Gellert, 1992;
Jung et al., 2006). The V, D and J coding segments must be assembled to form the variable domain exon, which encodes the binding pocket for antigens. This process is essential for acquired immunity, and without it, humans and other mammals have severe combined immune deficiency (SCID) (
Revy et al., 2005;
Schwarz et al., 1996).
The RAG1 and 2 genes evolved from transposons that can be found among invertebrates (
Chatterji et al., 2004;
Kapitonov and Jurka, 2005). The lymphoid-specific RAG1, RAG2, and the ubiquitous HMGB1 protein form a complex that binds to heptamer/nonamer recombination signal sequences (RSSs) that can have either a 12-or a 23-bp spacer between the heptamer and nonamer, and are therefore referred to as a 12-RSS and a 23-RSS (
Gellert, 2002;
Lieber, 2007;
Schatz, 2004;
Swanson, 2004). Each V, D, or J segment has a 12-or 23-RSS adjacent to it. One recombination reaction involves one 12-RSS and one 23-RSS, a requirement referred to as the 12/23-rule. The RAG complex nicks at the 5' edge of the RSS, and the RAG complex uses the 3'OH of the V, D, or J coding end to carry out a nucleophilic attack on the anti-parallel strand to create a perfect hairpin at the coding end, thereby resulting in a blunt signal end that has 5'P and 3'OH (
Roth, 1993;
Schlissel, 1993). The coding end formerly attached to the 12-RSS can be called the 12-coding end, and the 23-coding end is named correspondingly.
The opening of the coding end hairpins requires Artemis and DNA-PKcs (
Ma et al., 2002). DNA-PKcs is a serine/threonine protein kinase that must bind to DNA termini in order to become active as a kinase (
Anderson and Carter, 1996). Artemis and DNA-PKcs form a complex within the cell (
Ma et al., 2002). When DNA-PKcs encounters a DNA terminus, it becomes active as a kinase, autophosphorylates itself, and thereby alters the conformation of Artemis in a manner that permits Artemis to function as an endonuclease (
Ma et al., 2002). The endonucleolytic properties of Artemis permit it to nick hairpins, preferably 2 nt 3' of the tip, and to nick 5' overhangs and 3' overhangs (
Ma et al., 2002).
The activation of DNA-PKcs has been a point of particular uncertainty. It has been shown that the signal ends are bound so tightly by the RAG complex after coding end hairpin formation, the two signal ends cannot be ligated unless the DNA is deproteinized (
Jones and Gellert, 2001). The 12-RSS is only 28 bp long, and DNA-PKcs plus Ku require 27 bp of naked DNA for DNA-PKcs to be stimulated (
West et al., 1998). Hence, it is difficult to invoke the two signal ends as being sufficiently exposed to activate DNA-PKcs. Each coding end is in a hairpin conformation, and very nice work has demonstrated that hairpins do not activate DNA-PKcs for phosphorylation of p53 peptide targets (
Smider et al., 1998), though the results for stimulation of autophosphorylation have been more complex (
Soubeyrand et al., 2001). Hence, it has been unclear if either the signal ends or the coding ends are capable of activating DNA-PKcs. If neither can activate DNA-PKcs, then how is DNA-PKcs activated?
Once the hairpins are opened, polymerase mu or polymerase lambda can fill-in gaps or 5' overhangs in a template-dependent manner (
Ma et al., 2004;
NickMcElhinny and Ramsden, 2003). TdT, and to a lesser extent pol mu, can add nucleotides in a template-independent manner (
Gu et al., 2007a). Artemis:DNA-PKcs can continue to resect flaps or to nick at gaps (
Ma et al., 2005b). XLF:XRCC4:DNA ligase IV can carry out ligation, with XLF (also called Cernunnos) stimulating incompatible DNA end ligation (
Gu et al., 2007b;
Tsai et al., 2007).
The joining phase of V(D)J recombination illustrates several aspects of the flexibility of the enzymes involved. Pol mu and pol lambda can slip on the template or mis-incorporate nucleotides more frequently than high-fidelity polymerases (
Ramadan et al., 2004). Pol mu can add in a template-independent manner (
Gu et al., 2007a;
Ramadan et al., 2004). The ligase complex can ligate the top strand independently from the bottom strand, can ligate incompatible ends, and can ligate across gaps (
Gu et al., 2007a). Once activated, the Artemis:DNA-PKcs complex can not only act at 5' or 3' overhangs but can also nick at gaps; therefore, it can revise junctions that are ligated on only one strand (
Ma et al., 2005b).
In summary, the lymphoid-specific components that have been identified include RAG1, RAG2 and TdT, whereas HMGB1, Ku70/86, Artemis, DNA-PKcs, pol mu, pol lambda, XLF, XRCC4, and DNA ligase IV are present in all vertebrate somatic cells. Despite identification of these proteins, reconstitution using purified components has not been described. The closest approach has been to describe some level of coding joint formation in crude extracts (
Leu et al., 1997;
Ramsden et al., 1997;
Weis-Garcia et al., 1997). However, crude extracts can be misleading (especially for multi-step/multi-component processes) and led one group to infer a role for DNA ligase I in V(D)J recombination (
Ramsden et al., 1997), rather the genetically-proven ligase IV (
Grawunder et al., 1997;
Schar et al., 1997;
Teo and Jackson, 1997;
Wilson et al., 1997). Once components are genetically-identified, reconstitution is an important aspect in understanding any complex biochemical pathway (
Aboussekhra et al., 1995;
Kadyrov et al., 2006;
Klungland and Lindahl, 1997;
Kubota et al., 1996).
Here we have succeeded in achieving coding joint formation using the purified human proteins described above. The coding joints generated by this system show the junctional features seen for coding joints formed within lymphoid cells, including N-nucleotide addition due to TdT, P-nucleotide addition due to hairpin opening by Artemis:DNA-PKcs, and nucleolytic resection due to Artemis:DNA-PKcs. The coding joint formation is abolished in the absence of RAGs, Artemis, DNA-PKcs, or XRCC4:DNA ligase IV. Using additional biochemical approaches, we have documented that the Artemis:DNA-PKcs complex can indeed be activated by DNA hairpins, that this activation is caused by the hairpin to which the complex is bound (cis activation), and that this activation results in the nicking of the hairpins. Hence, the defined system permits generation of junctional and combinatorial diversification of the vertebrate immune system using purified human proteins.