Most organisms possess sophisticated defense mechanisms to protect them against the invasion of foreign agents such as viruses, bacteria, and parasites. Simple prokaryotes use a complementary system involving DNA methylation of the host genome and endonuclease degradation of foreign genomic material to differentiate self from non-self [
18]. Eukaryotes have developed more sophisticated systems to thwart off the invasion of foreign substances. Indeed, the mammalian immune system is arguably one of the most intricate and ingenious methods for actively seeking out and killing a wide variety of invaders.
The vertebrate immune system is divided into two subcategories, the innate and adaptive immune systems, that differ in their specificity. The innate immune system is generally considered to be less specific due to the promiscuous ability of the immune receptors to recognize a limited number of molecules that are common features to many infectious agents including polysaccharides, peptidoglycans, non-methylated CpG DNA, and double-stranded RNA [
19–
22]. This promiscuous activity allows the innate immune system to act as the first line of defense against infection by rapidly recognizing and responding to pathogens. If the defensive line of the innate system is breached, then a more specific and highly specialized offense system, the adaptive immune response, is mobilized to its full potential.
Adaptive immunity came into existence in vertebrates roughly 500 million years ago [
23]. The cells of the adaptive immune system, namely T- and B-cells, have a diverse repertoire of antigen receptors and antibodies that can recognize any antigen encountered throughout life [
24]. After the adaptive immune cell receptors bind an antigen, they mount a rapid and robust protective response by a dramatic expansion in the number of pathogen-specific T cells [
25–
28]. Over the course of one week, thousands of clones are produced that possess effector functions [
29–
31]. Approximately 95% of these activated T-cells undergo apoptosis [
30,
32]. However, a stable population of long-lived T cells resides in the lymphoid and non-lymphoid tissues [
33,
34] and patrol for these previously encountered pathogens. The immunological memory displayed by the adaptive immune system provides the vertebrate host with long-lasting protection against subsequent infection. For example, most individuals remain immune to measles for up to 75 years once exposed to an attenuated form of measles virus [
35].
At the molecular level, the cells of the immune system have developed a strategy to increase acquired immunity against subsequent biological assaults [
36] (). This process, commonly known as V(D)J recombination, plays an essential role in abrogating these antigens. Rearrangement of the variable (V), diversity (D) and joining (J) gene segments creates versatility to a competent immune system by generating a diverse repertoire of antigen receptors with unique antibody specificities [
37]. This transaction of breaking, rearranging, and rejoining of the V, D, and J regions of the germline immunoglobulin genes requires the collaborative efforts of the three distinct enzyme activities that include nucleases, polymerases, and ligases. Of the three major types of enzymatic activities, our understanding of how specific DNA polymerases function during V(D)J recombination is not yet firmly established. However, crucial information for understanding the role of specific polymerases in V(D)J recombination has started to emerge. The relative functions of the various members of the pol X family of DNA polymerases (TdT, pol μ, and pol λ) during the processing of DNA in V(D)J recombination are distinct [
38] and nonoverlapping [
9]
in vivo. Ramsden and colleagues have indicated that a “gradient” of weak to strong terminal deoxynucleotidyl transferase activity defines the distinct roles of pol λ, pol μ, and TdT in nonhomologous end joining (NHEJ), respectively [
38]. Moreover, Bertocci
et al have shown that pol μ participates exclusively in light chain and not in heavy chain gene rearrangement [
9,
39]. In contrast, pol λ is reported to be recruited only in the heavy chain junctions during V(D)J recombination and precedes the action of TdT [
9], which is primarily involved in the random addition of nucleotides to unpaired primer termini [
38]. While pol μ and pol λ play important roles in the immune system, the role of TdT in V(D)J rearrangement is showcased in this review. A more thorough description of pol β, pol μ and pol λ is provided in a chapter by Joann Sweasy and colleagues in this Special Issue.
illustrates the crucial steps of V(D)J recombination during the
RAG cleavage phase. This process is initiated by introducing a double strand break (DSB) at the edge of the selected gene segment by the
RAG-1 and
RAG-2 (Recombination-Activating Genes) proteins. These proteins selectively bind to specific and highly conserved recombination signal sequences (RSS) that contain heptamer and nonamer elements separated by a spacer region [
40,
41]. There are either 12 bp or 23 bp spacers between the heptamer and nonamer elements of the RSS commonly denoted 12-RSS or 23-RSS [
40–
43], respectively. Proper recombination demands that one 12-RSS and one 23-RSS be present for efficient cleavage
in vivo [
40,
41,
44], and this phenomenon is referred to as the “12/23-rule”. After recognition of complementary RSS, the
RAG complex introduces a nick between the D and J coding segment and the adjoining recombination signal sequence. The
RAG complex also mediates the formation of hairpins at each coding end by using the 3′-OH moiety at each nick as the nucleophile.
Double-strand DNA breaks introduced during the RAG cleavage phase are repaired during the non-homologous end-joining phase (). The nuclease, Artemis:DNA-PK
cs, can trim the 5′ and 3′ overhangs [
45] and open the hairpins at the coding ends generating palindromic nucleotide sequences (P-nucleotides). At this point, TdT randomly adds nucleotides to available 3′-OH ends increasing the variability of the non-templated nucleotide (N) region of the recombined gene segments [
46]. The details of this process are provided in . While
in vitro studies have shown that TdT can incorporate all four natural nucleotides on to single-stranded DNA (vide infra), there is a definitive bias for the incorporation of dGMP and dCMP versus dAMP or dTMP observed
in vivo [
8,
47–
50]. This preference may offer a plausible explanation for the high G/C content of the Ig and TCR N-regions [
48,
49]. In addition, the average N-nucleotide segment length created
in vivo is only 2–5 bp per coding joint [
47]. This length appears optimal for allowing the DNA strands to undergo microhomology alignment using Watson-Crick base pair recognition patterns. Unpaired nucleotides are trimmed by an exonuclease such as the Artemis:DNA PK
cs complex [
45]. The gaps are ultimately filled by template-dependent DNA polymerases, and the ligation of the coding ends is carried out by the XRCC4:DNA ligase IV complex [
45].
The role of TdT in generating immunological diversity is to catalyze the random addition of small numbers of nucleotides at the N regions. This activity has been speculated by virtue of its catalytic properties in addition to its localization to primary lymphoid tissues such as thymus and bone marrow [
15,
51,
52]. A wealth of information to support this hypothesis exists and include the correlation between the existence of N regions with TdT expression [
13] or mRNA expression [
53] and with the inclusion of the TdT gene into a cell line that continuously rearranges immunoglobulin genes but lacks the enzyme [
7,
8]. In addition, TdT-knockout mice and TdT-deficient lymphocytes [
6,
54] have also been used to validate the role of TdT in N-nucleotide addition during the V(D)J recombination process. As expected, TdT-knockout mice show a ten-fold reduction of T-cell receptor (TCR) diversity compared to the wild-type mice and provides evidence that the addition of N-nucleotides by TdT is of paramount value to establishing a combinatorially diverse antigen receptor repertoire [
54].