|Home | About | Journals | Submit | Contact Us | Français|
Retroelements are important evolutionary forces but can be deleterious if left uncontrolled. Members of the human APOBEC3 family of cytidine deaminases can inhibit a wide range of endogenous, as well as exogenous, retroelements. These enzymes are structurally organized in one or two domains comprising a zinc-coordinating motif. APOBEC3G contains two such domains, only the C terminal of which is endowed with editing activity, while its N-terminal counterpart binds RNA, promotes homo-oligomerization, and is necessary for packaging into human immunodeficiency virus type 1 (HIV-1) virions. Here, we performed a large-scale mutagenesis-based analysis of the APOBEC3G N terminus, testing mutants for (i) inhibition of vif-defective HIV-1 infection and Alu retrotransposition, (ii) RNA binding, and (iii) oligomerization. Furthermore, in the absence of structural information on this domain, we used homology modeling to examine the positions of functionally important residues and of residues found to be under positive selection by phylogenetic analyses of primate APOBEC3G genes. Our results reveal the importance of a predicted RNA binding dimerization interface both for packaging into HIV-1 virions and inhibition of both HIV-1 infection and Alu transposition. We further found that the HIV-1-blocking activity of APOBEC3G N-terminal mutants defective for packaging can be almost entirely rescued if their virion incorporation is forced by fusion with Vpr, indicating that the corresponding region of APOBEC3G plays little role in other aspects of its action against this pathogen. Interestingly, residues forming the APOBEC3G dimer interface are highly conserved, contrasting with the rapid evolution of two neighboring surface-exposed amino acid patches, one targeted by the Vif protein of primate lentiviruses and the other of yet-undefined function.
Retroelements have invaded the genomes of all organisms, from bacteria to primates, in amounts that often greatly exceed the sum of host protein-coding sequences. As such, they are essential evolutionary forces, but they must be tightly regulated to avoid deleterious consequences that would result from their uncontrolled spreading. The seven members of the human APOBEC3 family of cytidine deaminases partake in this process as innate immunity effectors that inhibit a large panel of retroelements, from endogenous retrotransposons to exogenous retroviruses (reviewed in references 14 and 22).
APOBEC3 proteins are characterized by the presence of one or two domains comprising the canonical zinc-coordinating motif (HXEX24-30PCX2-4C), essential for catalytic activity (33). APOBEC3G (A3G) contains two such domains, only the C terminal of which is endowed with editing potential (26, 43), whereas its N-terminal counterpart is enzymatically inactive but has RNA binding properties (32, 43). A3G can block the propagation of Vif-defective (Δvif) human immunodeficiency virus type 1 (HIV-1) (52) but is inactivated by the wild-type virus through Vif-induced proteasomal degradation (18, 40, 53, 55, 65). In the absence of Vif, the A3G antiviral effect requires both zinc-coordinating motifs, the N terminal for recruitment into particles and the C terminal for lethal editing of the viral DNA. A3G interacts via its N terminus with the nucleocapsid (NC) moiety of Gag, possibly through an RNA intermediate, and is thereby recruited at sites of viral assembly at the plasma membrane (4, 7, 56). Once the virus is in target cells, A3G interferes with reverse transcription at several steps (24, 25, 41), among which is the C-to-U editing of the nascent minus-strand viral DNA that critically depends on a functional C-terminal zinc-coordinating motif (27, 38, 66). It is noteworthy that the editing activity of A3G is not essential for its blockade of at least two other retroelements, the hepatitis B virus (58) and the Alu endogenous retrotransposon (16, 29).
In cells, A3G partakes in high-molecular-mass ribonucleoproteic complexes together with close to 100 other proteins (16, 21, 36). A3G binds single-stranded polynucleotides (32, 43) and has been found to associate with 7SL, Alu, and hYs RNAs (16, 62). A3G can also form oligomers (7, 10, 15, 33, 43), and the small angle of the X-ray scattering profile of these complexes is compatible with the volume of a homodimer (63). Mutating the N-terminal zinc-binding motif of A3G abrogates its RNA binding and self-association potential (43).
The recent structure determination of the A3G C-terminal domain has shed light on determinants dictating single-stranded DNA binding and editing (11, 28, 49). In contrast, the lack of structural data on its N-terminal counterpart, as well as the absence of enzymatic activity of this domain, have precluded reaching a similar understanding of the specific roles played by the zinc-binding motif and other features of this region of A3G. As a step toward investigating these issues, we conducted a large-scale mutagenesis/functional study of the A3G N-terminal module, mapping determinants crucial for RNA binding; oligomerization; and activity against an exogenous retrovirus, Δvif HIV-1, and an endogenous retroelement, Alu. We then integrated these data in a structural model. Our results reveal that residues important for the restriction of both retroelements form a surface predicted to mediate contacts between two A3G monomers and to form a groove that can potentially accommodate single-stranded polynucleotides. Furthermore, the results of phylogenetic analyses indicate that this region is highly conserved, in contrast with two adjacent patches of amino acids subjected to positive selection, one corresponding to the region targeted by Vif, the lentiviral antagonist of A3G, and the other of yet-unknown function.
HIV-1 viruses encoding all but the Vif protein were produced from the pR9Δvif plasmid (60). The plasmids required for the LINE-1-driven Alu retrotransposition assay were a kind gift from J. V. Moran (pALU_pA+_neoTET and pJM101_L1.RP_NoNeo) (19, 29). pCMV plasmids coding for APOBEC proteins fused to a C-terminal hemagglutinin (HA) tag are described in reference 34. Point mutations were introduced using Quikchange site-directed mutagenesis (Stratagene). Each mutant was verified by performing a sequencing reaction. The C-terminal segment 198 to 384 of A3G was amplified from pCMV-A3G-HA by PCR using primers A3G198-384_F, 5′ GCGTATCGATAAGCTTGGCCAAGGATGGATCCACCCACATTCAC 3′, and 009, 5′ GCACTGGAGTGGCAACTTCC 3′, and introduced back into pCMV-A3G-HA with HindIII and XbaI to replace the wild-type protein. The plasmid coding for C-terminally myc-tagged A3G under the cytomegalovirus promoter was obtained by D. Kabat through the AIDS Research and Reference Reagent Program (pCDNA3.1.A3G-myc-His). The Vpr construct with residues 1 to 14 deleted (Vpr14-88) was cloned from the wild-type HIV-1 plasmid pR9 (60) by PCR using primers Vpr_BF, 5′ CCGGGGGGAAGCTTGGCCACCATGCCATACAATGAATGGACA 3′, and Vpr_ER-pcs, 5′ CCGGGGGGAAGCTTTTCCACCTACGATCGGATAATTCTGGCTTACTCTCCTCTGTCGAGTAAC 3′, and a proofreading DNA polymerase (PfuTurbo, Stratagene). The Vpr_ER-pcs primer contains the HIV protease-dependent cleavage site SQNYPIV. The PCR fragment obtained was next introduced into the HindIII site of pCMV-A3G-HA to obtain plasmid pCMV-Vpr14-88-A3G-HA. The plasmid coding for Vif C133S is described in reference 34 (pRRLsin.cPPT.PGK.GFP-2A-Vif-myc).
HEK293T and CD4+, HIV-1 long terminal repeat-LacZ-containing, HeLa-derived P4.2 cells (9) were regularly passaged in Dulbecco's modified Eagle's medium supplemented with 10% fetal calf serum, 2 mM glutamine, and antibiotics (100 U/ml penicillin and 100 mg/ml streptomycin), while HeLa HA cells were passaged in minimum essential medium supplemented with nonessential amino acids (Gibco) in addition to 10% fetal calf serum, antibiotics, and 2 mM glutamine. HEK293T cells were transfected by using Fugene 6 (Roche) or Lipofectamine 2000 (Invitrogen), following the manufacturer's instructions. HeLa HA cells were transfected with Fugene 6 using a previously described protocol (29).
Viruses were produced in duplicates by transfection of HEK293T cells seeded in six-well plates, using the pR9Δvif plasmid, in the presence or of or without APOBEC proteins. To determine the activity levels of the mutants independently of their expression levels (see Fig. Fig.2C),2C), we produced Δvif HIV-1 viruses in the presence of wild-type A3G at three or four different doses and compared the restriction activity levels of the mutants to that of the corresponding dose of the wild type. At 12 h posttransfection, the medium was changed, and it was collected 24 h later. Cell-free virions were prepared by filtration through a 0.45-μM-pore-size nitrocellulose membrane (Spin-X; Corning) and concentrated in a microcentrifuge for 1.5 h at 13,000 rpm at 4°C. The pellet was resuspended in 20 μl of phosphate-buffered saline, and the amount of physical particles was monitored by reverse transcriptase activity assay (1). The infectious titer was determined by duplicate infections on the P4.2 indicator cell line (39). The specific titer was calculated by dividing the infectious titer by the amount of physical particles as determined by reverse transcriptase activity. The expression level of APOBEC proteins was evaluated by Western blot analysis of 2 μg of cell extract collected at the time of virion harvesting. To evaluate the amount of APOBEC protein inside viral particles, an equivalent of 1.5 × 106 cpm of virions was run on a sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) gel. APOBEC proteins were detected with a mouse HA-specific, peroxidase-conjugated 3F10 monoclonal antibody (Roche). The viral capsid was detected by using mouse p24-specific antibody (B. Chesebro and H. Chen through the AIDS Research and Reference Reagent Program).
The LINE-1-driven Alu retrotransposition assay was performed as previously described (29). Briefly, 3 × 105 to 5 × 105 HeLa HA cells were seeded into T75 flasks and transfected 15 h later with pALU_pA+_neoTET, pJM101_L1.RP_NoNeo, and pCMV5 or an APOBEC-encoding plasmid. Three days posttransfection, the cells were selected with G418 at 2 mg/ml for 2 days and then at 0.5 mg/ml for 4 to 5 more days. Colonies were then fixed with ethanol and stained with crystal violet and counted, and the counts averaged from duplicates. The expression levels of APOBEC proteins were evaluated by Western blot analysis of lysate from cells transfected in parallel and collected at the time of selection. Evaluation of the expression-independent activity of the mutants was performed by transfecting wild-type A3G at three different doses and comparing the activity levels of the mutants to that of the corresponding dose of the wild type.
Transfected HEK293T cells were lysed in radioimmunoprecipitation assay buffer supplemented with antiprotease cocktail set I (Calbiochem) and 100 U/ml of RNase inhibitor (RNAseIN; Promega). After a short spin to remove cell debris, a 50- to 100-μg amount of cell lysate was adjusted to 500 μl with lysis buffer. Ten microliters was retrieved and loaded on a SDS-PAGE gel, and 20 μl was used for reverse transcription-PCR (RT-PCR). Forty microliters of anti-HA affinity matrix (Roche) was added to the rest of the mix, which was swirled for 2 to 3 h at 4°C and then extensively washed using radioimmunoprecipitation assay buffer supplemented with NaCl at up to 500 mM. After the last wash, half of the beads were incubated in β-mercaptoethanol and SDS-containing buffer for 3 min at 95°C before being run on a SDS-PAGE gel. RNA was extracted from the other half and analyzed by RT-PCR. For myc-tagged A3G, immunoprecipitations were performed similarly, using anti-c-Myc affinity matrix (Santa Cruz), and the protein was detected by Western blotting with a mouse anti-c-Myc antibody (Roche).
RNA was extracted using an RNeasy mini kit (Qiagen). After DNase treatment (Sigma), RNA was reverse transcribed using random hexamers and Superscript III reverse transcriptase as recommended by the manufacturer (Invitrogen). For each sample, a control reaction lacking the enzyme was included to check for contaminating DNA. We amplified the resulting cDNAs by PCR using GoTaq green master mix (Promega) and specific primers for 7SL (7SL_BF2, 5′ TCCGCACTAAGTTCGGCATC 3′, and 7SL_ER2, 5′ AGTGGCTATTCACAGGCGCG 3′), hY1 (hY1_F2, 5′ CTGGTCCGAAGGTAGTGAG 3′, and hY1_R2, 5′ CTAGTCAAGTGCAGTAGTG 3′), hY3 (hY3_F2, 5′ GCTGGTCCGAGTGCAGTGGTGTTTAC 3′, and hY3_R2, 5′ AGGCTAGTCAAGTGAAGCAGTG 3′), and Alu (AluWBF, 5′ TCACGCCTGTAATCCCAGCA 3′, and AluR2, 5′ GATCTCGGCTCACTGCAAG 3′). The linearity of the PCR was verified by parallel amplification of serially diluted human genomic DNA.
The N-terminal domain of the ABC3G_HUMAN sequence (residues 12 to 194; UniProtKB  code Q9HC16) was defined as the target sequence, while the recently resolved human A3G C-terminal domain (28) (PDB  ID 3E1U) served as a template. The two A3G domains show a good sequence identity (36%). Sequence alignment was done with the align2d function of the MODELLER program (50), as well as with the CLUSTALW (13) and T-COFFEE (44) algorithms, and subsequently modified based on structural alignments with CHIMERA (47).
Homology models were built with the MODELLER program (50); 1,000 models were calculated and scored with the MODELLER objective function and the ANOLEA energy (42) using the default 5-residue-window averaging. ANOLEA estimates the folding free energy of each amino acid in a three-dimensional structure. The model with a MODELLER objective function of less than 1,100 and the best ANOLEA energy for the nonloop regions was chosen for further refinement. Residues 140 to 146 were treated with the MODELLER loop refinement routine because of their poor alignment with the template structure. Again, 1,000 models were calculated, and the best conformation according to its ANOLEA score was used in the refined A3G monomer model. A zinc ion was added to the pseudoactive site of the model before it was energy minimized with the CHARMM program (6) and the CHARMM22 all-atom force field (37). The minimization consisted of 100 steps of steepest descent. A harmonic restraint of 5 kcal/mol/Å2 toward their initial position was imposed on all heavy atoms except for the zinc site residues His65, Glu67, Cys97, and Cys100, which were restrained to display the same distances to the zinc ion as in the X-ray structure. Two copies of the final N-terminal model were structurally aligned to chains A and C of the APOBEC2 (A2) tetramer (48) (PDB ID 2NYT) in order to create a putative A3G head-to-head dimer structure. The side-chain conformations of residues 27 and 124 to 127 at the dimerization interface were reassigned using SCWRL3 (8) and minimized again with CHARMM in order to remove clashes. The FoldX program (23) was used to perform an in silico alanine scan by mutating each residue to an alanine and estimating the change of folding energy. Molecular visualization and structural alignments were done using CHIMERA (47).
To initiate a structure-function study of the human A3G N-terminal domain, we first individually mutated 55 residues in this region (Fig. (Fig.1A)1A) (67), concentrating on positions that are highly conserved among primate orthologues or subjected to positive selection during primate evolution (46, 51) and opting for conservative amino acid substitutions in most cases (5). Thirty-six of the resulting mutants exhibited wild-type steady-state levels of expression, while 19 appeared less stable (Fig. (Fig.1B).1B). We then tested these derivatives for their efficiency at blocking Δvif HIV-1 infection and Alu retrotransposition in single-round assays (Fig. (Fig.2).2). Although the magnitudes of the blockades obtained with wild-type A3G differed between the two systems, in both cases it was dose dependent, with the inactive A2 serving each time as a negative control (Fig. (Fig.2A).2A). Remarkably, all mutants exhibited parallel phenotypes for both classes of retroelements: of the well-expressed mutants, six (S28A, W94L, Y124A, Y125A, F126L, and W127L) exhibited marked decreases in Δvif HIV-1 and Alu restriction, whereas one, the D130K mutant, had instead a reproducible increase in inhibitory potency (Fig. (Fig.2B).2B). The W127A mutant, slightly more stable than the W127L mutant described previously (31), was also defective (not illustrated). In contrast, mutations at several positions subjected to positive selection during primate evolution (Tyr13, Arg29, Glu85, Lys99, Arg102, and Asp128) did not affect activity against Δvif HIV-1 or Alu (data not shown). The activity levels of the 19 less stable mutants were normalized for the expression levels. Eleven of them (F21L, R30E, E85Q, P96L, L108A, L116A, L123A, L138A, M152A, C160S, and F164L) then appeared as active as the wild type against both Δvif HIV-1 and Alu (not illustrated), while eight, including derivatives carrying changes in zinc-coordinating residues (H65R, E67Q, C97S, and C100S), were defective (Fig. (Fig.2C).2C). This result highlights the importance of an intact N-terminal zinc-coordinating motif for inhibiting both retroelements, contradicting a previous report that this motif is dispensable for Alu restriction (16). We also tested two C-terminal domain mutants, one at a position homologous to the N-terminal Trp94 (W285L) and the other with a change at a zinc-binding position (C288S). These two residues have been shown to contribute to the catalytic activity of recombinant C-terminal domain A3G (11, 12). Both mutants were defective for Δvif HIV-1 restriction but fully competent for Alu inhibition (Fig. (Fig.2D),2D), consistent with the previously noted importance of editing for blocking the former but not the latter (16, 29, 38).
Interaction with RNA seems to govern two properties of A3G, namely, binding to the HIV-1 NC, which leads to recruitment into nascent particles, a prerequisite for viral inactivation, and oligomerization (4, 15, 56). Furthermore, an intact N-terminal zinc-coordinating motif is required for the self-association of A3G, as well as RNA binding (43). Therefore, we probed the RNA binding ability of A3G N-terminal mutants by RT-PCR analysis of immunoprecipitates recovered from transiently transfected HEK293T cells (Fig. (Fig.3).3). Contrary to the results for A2, wild-type A3G was found to associate with hY1, hY3, 7SL, and Alu RNAs, as previously described (16, 62). In contrast, functionally defective N-terminal mutants had, to various degrees, a decreased affinity for one or more of these RNAs. Although the background level was relatively high for Alu (Fig. (Fig.3,3, lane marked “no RT”), interaction with this RNA was reduced to negligible levels for the two most inactive mutants, the W94L and W127A mutants. The D130K and W285L mutants did not show a significant change of affinity for any RNA tested.
To evaluate the oligomerization potential of the A3G mutants, we expressed them as HA-tagged derivatives together with myc-tagged wild-type A3G and performed HA- and myc-specific Western blot analyses of antimyc-mediated immunoprecipitations (Fig. (Fig.4).4). While wild-type HA-tagged A3G efficiently coimmunoprecipitated with its myc-tagged counterparts, N-terminal mutants were impaired for oligomerization to degrees largely corresponding to their RNA binding and antiviral activity defects (Fig. (Fig.4A,4A, left). In contrast, three mutations in the C-terminal domain of A3G (W285L, C288S, and Y315A), in spite of being at positions homologous to those in three N-terminal oligomerization-defective mutants (W94L, C97S, and Y124A) (45), had no impact on A3G self-association (Fig. (Fig.4A,4A, right). Neither APOBEC3A (A3A) nor a version of A3G with an N-terminal deletion (A3G198-384) (12) could bind wild-type A3G (Fig. (Fig.4B).4B). Interestingly, most of the A3G oligomerization determinants identified in this work have aromatic groups (Trp94, Tyr124, Tyr125, Phe126, and Trp127), most of which do not have counterparts in A3A and in the C-terminal domain of A3G. Unfortunately, A3G derivatives lacking the C-terminal domain were unstable and could not be tested for RNA binding or self-association.
Individual mutations of two aspartates to lysine at positions 128 and 130 of A3G impair recognition by HIV-1 Vif (31, 39). Owing to the close proximity of these residues to the patch at residues 124 to 127 that is involved in oligomerization, we investigated whether Vif could prevent A3G complex formation. For this, we performed the myc-/HA-tagged A3G coimmunoprecipitations in the presence or absence of HIV-1 Vif C133S, which binds A3G but fails to trigger its proteasomal degradation (35). As reported previously (35), Vif C133S did not affect the antiviral potency of A3G (data not shown). In line with this result, it did not interfere with A3G oligomerization, in spite of associating with the cytidine deaminase (Fig. (Fig.4C).4C). Of note, the mutants D128K and D130K that do not bind Vif had full oligomerization and RNA binding abilities (Fig. (Fig.33 and and4A),4A), and the oligomerization-defective A3G W94L and W127A mutants were efficiently degraded by wild-type Vif (data not shown) (31). Altogether, these data indicate that distinct A3G domains dictate oligomerization and recognition by HIV-1 Vif.
The incorporation of A3G into assembling HIV-1 particles is essential for its antiviral activity against this pathogen (43). As a remarkable correlate, the functional defects of well-expressed A3G N-terminal domain mutants with impaired RNA binding and antiviral activities were inversely proportional to their packaging efficiency (Fig. (Fig.5A),5A), in agreement with the RNA-dependent association of A3G to the HIV-1 NC (4, 56). This is notably exemplified by the S28A and S28E mutants, whose impact on Δvif HIV-1 infectivity closely paralleled their packaging efficiencies. In contrast, the C-terminal W285L mutant was efficiently encapsidated in spite of its restriction defect. To further probe the contribution of the packaging defect of A3G N-terminal mutants to their functional impairment, we forced their encapsidation by fusion with a fragment of the HIV-1 Vpr protein that was sufficient for recruitment into viral particles but devoid of cytotoxicity (amino acids 14 to 88) (2). A peptide linker that is sensitive to the viral protease was placed between the two parts of the resulting fusion proteins, allowing the release of the A3G moiety following virion maturation to avoid subsequent influence of Vpr on the A3G activity. Vpr-mediated particle incorporation significantly rescued the antiviral phenotype of the packaging-defective A3G mutants (Fig. (Fig.5B).5B). Although the activities of the least-defective mutants were not fully restored, the antiviral effects of the S28E, W94L, and W127A mutants increased approximately 10 times in this context. This indicates that N-terminal residues that are essential for A3G packaging do not subsequently play a major role in HIV-1 inhibition. In contrast, the W285L catalytic mutant remained inactive even when fused to Vpr.
A clear interpretation of our functional and biochemical data is complicated by the absence of any structural information on the N-terminal domain of A3G, due at least in part to the insolubility of the full-length protein at high concentration (11). It would be particularly interesting to determine whether determinants of restriction are part of a single domain or are dispersed in different areas. Taking advantage of the recently published crystal structure of the A3G C-terminal domain (residues 197 to 384) (28) and of the good sequence identity between the two domains (36%), we modeled the structure of the N-terminal domain (residues 13 to 194) using the MODELLER program (50). As the sequence identity between these structures is significantly higher than that with A2 (27%), the model presented herein may be considered more reliable than earlier A3G models based solely on A2 (30, 67).
Our model is shown in detail in Fig. Fig.6.6. It has a good ANOLEA score profile (data not shown), with a pattern comparable to that of the X-ray structure of the C-terminal domain, indicating good reliability of the predicted structure. Like the C-terminal domain, the N-terminal region of A3G forms a central anti-parallel β-sheet surrounded by α-helices. The positions of the loops delimited by residues 22 to 34 and 56 to 64 differ significantly and the regions containing residues 123 to 134 and 140 to 146 differ to a lesser extent from models based on A2. In our model, the N-terminal residues found to affect the expression of A3G upon mutation (Fig. (Fig.1B)1B) are either linked to the zinc ion (His65, Glu67, Cys97, and Cys100) or have an important structural function (Fig. (Fig.6A),6A), as suggested by an in silico alanine scan of the model (see Table S1 in the supplemental material). Out of those mutants, the ones that are defective regardless of their expression level (Fig. (Fig.2C)2C) are modified at positions involved in zinc binding (His65, Glu67, Cys97, and Cys100) or seem to play a prominent role in stabilizing the structure of the protein (Phe70, Tyr91, and Ile118) (Fig. (Fig.6A;6A; also see Table S1 in the supplemental material). Although Arg122 was not predicted to influence the overall structure of A3G, its high degree of conservation among cytidine deaminases, including A2, correlates with its important functional role. Since zinc often stabilizes protein domains (57), the disruption of this motif in the A3G N terminus might alter function indirectly, by altering local conformation. For A2, a single-domain cytidine deaminase, two monomers assemble in a long anti-parallel β-sheet with one zinc-coordinating domain at each extremity (48). Two of those dimers interact to form a linear tetramer. Since most of the residues important for A3G self-association resided in regions homologous to the ones mediating A2 tetramer formation and since various experiments have suggested that A3G could form, among different heterogeneous complexes, a homodimer (7, 10, 15, 33, 43, 63), we built a structural model of an A3G head-to-head dimer by structural alignment of two A3G N-terminal domains to two A2 monomers (Fig. (Fig.6B).6B). In this model, the residues found to be important for A3G functionality (Ser28, Trp94, Tyr124, Tyr125, Phe126, and Trp127) partake in stabilizing the dimer, notably, by forming numerous π-π interactions between the five aromatic residues and their cognate copies from the other monomer (Fig. (Fig.6C).6C). Trp127 makes extensive contacts with Trp94 and Tyr124 of the other monomer. In agreement, the W94L and W127A mutants were the two mutants with the weakest binding affinity for wild-type A3G (Fig. (Fig.5A).5A). Although they do not make direct contacts with Ser28 on the other monomer, Asp128 and -130 are in close proximity to it, an observation that parallels the observation of a reduced capacity of S28E, in comparison to that of S28A, to oligomerize with wild-type A3G (Fig. (Fig.5A).5A). Two additional residues participating in intersubunit contacts on A2 (48) are present on a helix corresponding to the most C-terminal helix in our model. Nine mutations in this helix (F157L, N176D, K180A, Y181A, I183A, L184A, I187A, E191Q, and I192A) did not affect A3G self-association in coimmunoprecipitation experiments (data not shown). Although this helix is close to the putative dimer interface, it displays less-extensive interactions with the other monomer than the loop containing residues 124 to 127 or the corresponding helix in the A2 tetramer, in agreement with the results of our mutational analysis.
The dimer interface forms a groove with a width of approximately 10 Å (Fig. (Fig.6D,6D, side view), compatible with the diameter of single-stranded but not double-stranded polynucleotidic chains, in agreement with in vitro data (64). The aromatic residues Trp94 and Tyr124 to Trp127 located at the bottom could interact with the nucleotide bases. The corners of the groove are delimited by two negatively charged residues, Asp128 and Asp130, which do not have extensive interactions with the other monomer, in agreement with our coimmunoprecipitation data (Fig. (Fig.5A).5A). Primate A3Gs have an unusually high rate of residues under positive selection, characteristic of rapid evolution (46, 51). Mapping these residues onto the model reveals that the large majority occur on one side of the dimeric protein, leaving the other side with only a few such residues (Fig. (Fig.6D,6D, top view and bottom view, respectively). At the dimer interface, a patch of amino acids subjected to positive selection is centered on the Vif-binding site, previously demonstrated to comprise residues 128 to 130 (Fig. (Fig.6D,6D, side view) (31, 67). Alignment of Old World monkey and hominid A3Gs shows that conserved residues overlap with secondary structure elements, further supporting the validity of our model (Fig. (Fig.7).7). In the loop domains, the three largest clusters of conserved residues encompass one or more of the six amino acids identified as important for restriction (Ser28, Trp94, Tyr124, Tyr125, Phe126, and Trp127).
In the present work, we combined functional analyses and structural modeling to delineate a structural region of the A3G N terminus that is involved in mediating oligomerization and RNA binding and is essential for A3G packaging into HIV-1 virions, inhibition of Δvif HIV-1 infectivity, and blockade of Alu retrotransposition. This structural region notably encompasses a series of aromatic residues, Trp94, Tyr124, Tyr125, Phe126, and Trp127, that are predicted to mediate intersubunit contacts through a series of π-π interactions, as is often found at protein-protein interfaces (17). The dimer interface is further characterized by the formation of a putative groove that has the width and electrostatic potential to bind single-stranded but not double-stranded polynucleotides. Although our data do not formally demonstrate the presence of a dimer rather than a higher-order structure, recent evidence indicates that the major form of oligomer detected in experimental settings comparable to ours is a dimer (30). In that report, Huthoff et al. similarly proposed an A3G head-to-head dimer model, mainly based on analysis of mutants with the two N-terminal mutations Y124A and W127A and homology with the structure of A2. The two models show striking similarities, notably revealing the importance of residues 124 to 127 in establishing intermonomeric contacts. However, interesting differences emerge between the two models, probably stemming from the use here of the now-available structure of the more closely related A3G C terminus as a template, while that of the more distant A2 was utilized by Huthoff et al. For instance, their study did not predict the putative RNA-accommodating groove created at the dimer interface, due to a different positioning of the loop containing residues 22 to 34 which buried the zinc-coordinating motif and placed Ser28 far from the other monomer. Also, a more twisted dimer positioned Asp128 at the dimerization interface, resulting in contacts with Arg122 of the other monomer, a possibility difficult to reconcile with the absence of both any oligomerization phenotype of the D128K mutant and the accessibility of this residue to Vif (Fig. (Fig.4A).4A). Finally, Huthoff et al. found that positively charged amino acids located at the edge of the dimer interface (Arg24, Arg30, and Arg136) might engage in stabilizing interactions with RNA. However, Arg24 (54) and Arg30 (Fig. (Fig.1B;1B; also see Table S1 in the supplemental material) were both found to contribute to protein stability, suggesting that these residues influence RNA binding only indirectly. In addition, we observed that charge-inverting mutations at four other positions on the same interface (Arg29, Lys63, Lys99, and Arg102) did not reduce A3G restriction activity or RNA binding (not illustrated).
Our functional analyses confirm that the C-terminal catalytic site of A3G promotes Δvif HIV-1 but not Alu inhibition. In contrast, they indicate that A3G requires an intact dimerization domain to act against both classes of retroelements. In the case of HIV-1, however, the implicated residues seem necessary mainly for gaining access to budding viral particles. Indeed, the defective phenotype of all derivatives impaired for packaging due to mutations at these positions could be rescued by forcing their incorporation into virions by fusion with Vpr (Fig. (Fig.5B).5B). Thus, the N-terminal module of A3G does not appear to play a major role in other aspects of the antiviral effect of the cytidine deaminase on HIV-1. In contrast, in the absence of a clearer understanding of how A3G blocks Alu retrotransposition, the role of the N-terminal domain in this process can only remain speculative. Some degree of correlation between Alu RNA binding ability and the Alu inhibition potential of the mutants studied here (Fig. (Fig.3)3) suggests a model whereby A3G blocks retrotransposition of this element by sequestering its RNA away from the LINE trans-acting proteins. However, this simple model is not supported by data obtained with A3A, since in the experimental system described here, this cytidine deaminase exhibited far greater levels of anti-Alu activity than A3G but no obvious affinity for its RNA, at least based on the results of RT-PCR analysis of A3A-specific immunoprecipitates (not illustrated).
The potential RNA-accommodating groove formed by the dimer interface (Fig. (Fig.6D)6D) is consistent with the RNase sensitivity of A3G oligomers (45). Furthermore, the fact that mutations abrogating dimerization and perturbing association with small cellular RNAs also prevented packaging into HIV-1 virions indicates that these three events are linked. The RNase sensitivity of the A3G-HIV-1 NC interaction was previously interpreted as indicating that an RNA intermediate is involved in linking the two partners (4, 56). Alternatively, it could be that NC does not recognize A3G as a monomer but as a dimer, the stabilization of which is RNA dependent. However, the apparent promiscuity of A3G packaging, which can be recruited into HIV-1 virions via either the cognate retroviral NC or its severe acute respiratory syndrome coronavirus homologue (61), makes a protein-protein-only interaction less likely, unless it is primarily structural motifs rather than specific amino acid sequences that are at play.
Several studies have pointed to the role of Asp128 and Asp130 in binding Vif (31, 39). A recent report indicated that Thr32 might be involved as well (54). On the A3G monomer, Asp128/Asp130 and Thr32 are located on opposite sides of the dimerization-mediating aromatic stretch. However, in our dimer model, Thr32 is situated close to Asp128 of the adjacent monomer (Fig. (Fig.6D,6D, side view). Together with the results of coimmunoprecipitation studies indicating that Vif does not prevent A3G oligomerization (Fig. (Fig.4C),4C), this strongly suggests that Vif can recognize a cytidine deaminase dimer. From a therapeutic perspective, since Vif binding and A3G function involve distinct regions, possible side effects of a Vif-targeting drug on A3G activity may be limited.
Considering that A3G dimerization and RNA binding are important for inhibiting retroelements as distant as HIV-1 and Alu, it is not surprising that the residues involved in this process are highly conserved (Fig. (Fig.7).7). Notably, the subjection of residues located at the corner of the dimer interface to positive selection (Fig. (Fig.6D,6D, side view) (46, 51) is consistent with their recognition by the A3G antagonist Vif (31, 67). In contrast, there is no obvious explanation for the finding that another group of residues lined up on one side of the protein (Fig. (Fig.6D,6D, top view versus bottom view) similarly underwent rapid evolution. While it is tempting to postulate that these positions might be important for target recognition, the normal levels of activity of the corresponding mutants against both HIV-1 and Alu suggest that these elements at least were not responsible for the selective pressure exerted at these positions, leaving the nature of the underlying evolutionary forces undefined.
This work was supported by grants from the Strauss Foundation and the Infectigen Association to D.T. and the Swiss National Science Foundation to D.T. and O.M.
Special thanks to Nadine Zangger for sequence alignments and Sujana Nylakonda and Nicola MacKinnon for technical assistance. The following reagents were obtained from the NIH AIDS Research and Reference Reagent Program, Division of AIDS, NIAID: pCDNA3.1.A3G-myc-His from David Kabat and mouse p24-specific antibody from Bruce Chesebro and Hardy Chen. For computational resources, we thank the Vital-IT team at the Swiss Institute of Bioinformatics. The modeling part of this work was performed within the Protein Modeling Facility (PMF) of the University of Lausanne. Molecular graphics images were produced using the UCSF Chimera package from the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco.
Published ahead of print on 23 September 2009.
†Supplemental material for this article may be found at http://jvi.asm.org/.