The immature virion is a roughly spherical shell of radially extended Gag molecules. The N-terminal Gag MA domains are bound to the inner viral membrane, and the C-termini of the Gag molecules project into the center of the virus ( and ). To a first approximation, all of the information necessary for retroviral particle assembly resides in the Gag polypeptide. For example, Gag alone can form extracellular virus-like particles in the absence of other viral proteins [
11] and Gag molecules can spontaneously assemble into spherical, immature virus-like particles
in vitro [
12-
14]. Nevertheless, although Gag itself encodes the necessary tertiary and quaternary interactions, it must be emphasized that assembly requires nonspecific RNA interactions both
in vivo and
in vitro, and is assisted by host factors
in vivo, including trafficking factors, assembly chaperones, and the ESCRT budding pathway, as reviewed elsewhere [
15-
18].
Although it was appealing to imagine that the Gag molecules were organized with icosahedral symmetry, analyses of immature virions of HIV-1 [
19-
21], Rous sarcoma virus (RSV) [
22] and murine leukemia virus (MLV) [
23] revealed non-icosahedral paracrystalline lattices composed of hexameric Gag rings. The rings are close-packed, with characteristic inter-ring spacings of ~80 Å. The immature virions of HIV-1 particles are often discontinuous, implying that the Gag lattice need not close fully during virus budding and that the membrane also plays an important role in stabilizing immature particles ([
24], H.-G. Kräusslich, pers. comm.). The different regions of Gag must, of course, work together in concert to drive particle assembly, and the emerging picture is that many different interactions collaborate to create viral particles. Indeed, the requirement for multiple weak, but highly cooperative interactions probably provides a mechanism for ensuring that particle formation only occurs when all essential aspects of assembly such as membrane binding and RNA packaging have been met. Thus, while different regions of Gag contribute to membrane binding, Gag-Gag interactions and cofactor recruitment, we wish to emphasize that the Gag polypeptide is an assembly machine that simultaneously performs
and integrates all of these different activities.
Membrane binding is mediated primarily by the MA domain
MA binds directly to the inner leaflet of the plasma membrane, and recent studies have helped elucidate how this domain targets Gag to the plasma membrane and triggers particle assembly at the correct time and place () [
16]. HIV-1 MA (colored red in and ) is a helical domain that displays a conserved patch of basic residues on the same face as the N-terminal myristoyl modification. C-terminal MA residues form an extended α-helix that projects away from the globular domain, on the side opposite the membrane binding face [
25,
26]. Membrane binding is mediated by insertion of the myristoyl group into the lipid bilayer and by the basic patch, which binds acidic phospholipids, particularly PI(4,5)P
2, a phosphoinosotide that is concentrated in the plasma membrane () [
27-
31]. Importantly, HIV-1 Gag is incorrectly directed to internal membranes upon depletion of cellular PI(4,5)P
2, indicating that this phospholipid is required for proper targeting
in vivo [
28]. Binding studies indicate that ionic interactions dominate the energetics of HIV-1 MA membrane binding
in vitro and that the myristoyl group makes only modest contributions to the binding interaction [
32]. Indeed, some retroviral MA proteins lack myristoyl (or similar) modifications altogether, and in those cases, membrane binding is mediated exclusively by ionic interactions [
33,
34]. Cholesterol also plays an important role in Gag targeting and particle assembly [
35], but the molecular mechanism by which Gag senses cholesterol is not yet known.
MA membrane binding appears to be positively coupled to Gag oligomerization and assembly in several different ways. Locally, the MA domain exhibits a two-state, “myristoyl switching” mechanism in which the myristoyl group is sequestered in a groove along the body of the globular domain in the soluble form of the protein and is then exposed for insertion into the bilayer [
36-
39]. Myristoyl exposure is favored by PI(4,5)P
2 binding, providing a mechanism for coupling plasma membrane recognition and membrane binding [
40]. Stable membrane binding and myristoyl exposure are also favored by Gag oligomerization, [
32,
37,
41,
42], and inositol phosphate binding promotes trimerization of Gag fragments even in the absence of the myristoyl moiety [
43]. Thus, downstream Gag-Gag interactions contribute to the affinity of MA-membrane interactions (and vice versa). The MA domain may also negatively regulate Gag assembly, probably by “folding back” onto the NC region, providing yet another potential mechanism for coupling membrane binding with the formation of the lateral Gag-Gag interactions essential for immature virion assembly [
43,
44].
Gag-Gag lattice interactions in the immature virion are mediated primarily by CA and SP1
The immature virion is stabilized by lateral interactions distributed throughout the Gag polypeptide, and the historical view of a discrete interaction site (or “I” domain) is therefore an oversimplification. Nevertheless, specific Gag regions are particularly important for immature particle formation, whereas other regions, such as MA and p6, are largely dispensable. Although MA couples membrane binding and assembly, it does not form a continuous regular lattice in the immature virion [
22] and is dispensable for particle formation [
45]. Similarly, p6 recruits the cellular machinery required for virus budding [
46], but does not make important Gag-Gag contacts. In contrast, critical contacts are made by the C-terminal domain of CA (CA
CTD), the adjacent SP1 spacer, the NC region, and, to a lesser extent, the N-terminal domain of CA (CA
NTD) [
42,
47-
52].
A recent electron cryotomographic analysis of immature HIV-1 virions confirmed the radial “beads on a string” arrangement of the different Gag domains and showed that the CA and SP1 regions formed a close-packed lattice of cup-shaped hexamers, in which the walls and bottom of the cups appeared to correspond to the CA and SP1 layers, respectively () [
24]. In contrast, the MA and NC layers lacked hexagonal order. Although the resolution was insufficient to position domains unambiguously, this reconstruction can now be used to guide possible models for Gag-Gag interactions, as constrained by relevant biochemical and high resolution structural data.
High resolution structures show that the free MA and CA
NTD domains are connected by a flexible linker that could easily span the distance between the MA and CA
NTD layers of the immature virion [
53,
54]. The MA domain does not change structure when tethered to CA
NTD, consistent with the idea that the membrane-binding “heads” of MA are connected via flexible linkers to the CA
NTD hexamers below. The oligomeric state of these membrane-bound MA proteins has not yet been established, however, since the matrix layer does not form a continuous lattice [
22], and MA proteins can form both trimers [
25,
40,
43,
55] and hexamers [
56]
in vitro. Interestingly, the MA and CA domains of RSV Gag are separated by an extended linker, called p10, which contributes to Gag assembly by forming an α-helical bundle with the neighboring subunit in the Gag hexamer ([
57], V. Vogt, pers. comm.).
The CA
NTD domain (colored dark green in -) is composed of seven α-helices packed in the shape of an arrowhead (CA helices 1-7), with an extended loop connecting helices 4 and 5 that binds the prolyl isomerase, cyclophilin A. CA
NTD interfaces within the immature lattice are still a matter of speculation, but biochemical studies have identified two regions that are important for immature virion formation: one encompassing helices 1 and 2, and a second surrounding helices 4 and 7 [
42,
47,
58,
59]. These observations are consistent with models in which the hexameric rings of CA
NTD in the immature lattice are stabilized by six-fold symmetric interactions involving helices 1 and 2, and contact(s) across the local two- or three-fold axes between neighboring hexamers involving helices 4 and 7 [
24]. Indeed, it was recently proposed that similar interhexamer contacts seen in two different MLV CA
NTD crystal forms might mimic the three-fold symmetric contacts between Gag hexamers in the immature lattice, and this proposal was supported by mutational analyses [
60,
61]. Nevertheless, further studies will be required to confirm these models and provide molecular details of the intra- and interhexamer contacts in the immature HIV-1 Gag lattice. In this regard, considerable insight will likely be gleaned from higher resolution structural studies of immature HIV-1, MLV, and MPMV (Mazon-Pfizer monkey virus) Gag assemblies formed
in vitro [
62-
65], and from deuterium-exchange protection experiments that map protein-protein interfaces in the immature lattice (P. Prevelige, pers. comm.).
The CA
CTD is also critical for immature particle assembly, and it appears to make both inter- and intrahexamer contacts in the Gag lattice [
24,
50-
52,
66,
67]. The globular CA
CTD domain (colored blue-green in -) is composed of a short 3
10 helix followed by an extended strand and four α-helices (CA helices 8-11) [
68,
69]. The strand/turn/helix 8 element is termed the major homology region (MHR) because its sequence is highly conserved across most retroviruses and retrotransposons [
2,
70]. Mutations in conserved MHR residues inhibit the assembly of immature particles, indicating that the MHR plays an important role in Gag lattice formation [
47,
50,
70,
71].
HIV-1 CA dimerizes in solution, and CA
CTD constructs have been crystallized as two distinct, but related dimers. One dimer is a side-by-side complex that is stabilized primarily by parallel hydrophobic packing of helix 9 against a symmetry mate () [
68,
69]. The RSV CA
CTD forms a similar dimer at low pH, albeit with a more hydrophilic interface and a somewhat different interdomain angle (R.L. Kingston, pers. comm.). The second HIV-1 CA
CTD dimer is a “domain-swapped” analog in which the MHR segments from each monomer associate to create a more extended dimer interface () [
72]. In this case, the structure was determined for a CA
CTD mutant that lacked a single amino acid residue in the helix 8/9 linker, which favored the domain-swapped conformation. Mutations expected to disrupt intermolecular helix 9 packing interfaces seen in both types of dimer reduced Gag dimerization (and trimerization)
in vitro, and inhibited the production of immature particles, supporting the biological relevance of the interface(s) [
42,
43,
47]. Possible roles for the two different CA
CTD dimers are discussed in greater detail below.
Residues located at the C-terminal end of CA and at the N-terminal end of the SP1 spacer are also essential for immature particle formation [
48,
73]. In the tomographic reconstruction of frozen-hydrated immature particles, density ascribed to the CA
CTD-SP1 region was located immediately beneath the central pore of each hexamer (i.e., the bottom of the cup), suggesting that this region stabilizes the hexamer. NMR studies of isolated Gag fragments indicate that the helical propensity for the CA-SP1 junction is weak [
74], but sequence analyses and mutational studies support the idea that this region may adopt an ordered helical structure in assembled virions [
73,
75,
76]. The CA-SP1 junction was therefore modelled as a six-helix bundle in the tomographic study (). MLV Gag, which lacks a spacer peptide between its CA and NC domains, may contain an equivalent region at the C-terminus of the CA domain, termed the “charged assembly helix”, which is also required for immature particle assembly [
77].
Finally, the NC region of Gag (colored blue in ) forms the innermost layer of the immature virion and also plays essential roles in particle formation [
21,
23,
51,
78-
81]. NC/RNA complex(es) do not follow the hexagonal symmetry of the CA and SP1 regions, and most experiments indicate that NC primarily “tethers” Gag molecules together, probably via RNA bridges, although additional NC-NC interactions may also occur [
79,
80,
82,
83]. NC/RNA tethers presumably increase the effective concentration of assembling Gag molecules, which could also play a more active role in orienting or otherwise facilitating CA-CA interactions.
While NC can bind RNA nonspecifically, packaging of the genome requires specific recognition of the dimeric, full-length viral RNA transcript. HIV-1 genome packaging requires an RNA element located near the 5′ end of the
gag gene (termed “Ψ”), which is composed of four stem-loops that can dimerize through self-complementary base pairing interactions of loop 1 residues () (reviewed in [
84]). Structures of NC in complex with two of the isolated stem loops from HIV-1 Ψ show that the CCHC zinc fingers of NC make sequence-specific, but remarkably different, contacts with exposed loop residues () [
85,
86]. The NC/RNA interactions visualized in these structures undoubtedly contribute to the specificity of genomic RNA packaging, although the full mechanism by which HIV-1 Gag packages dimeric RNA genomes has yet to be delineated. Even greater progress on the mechanism of genomic RNA packaging has been made for RSV and MLV, where complexes of NC with minimal Ψ RNA elements are now available [
87,
88]. In the MLV case, the high-affinity NC binding site is initially sequestered by base pairing in the monomeric form of the viral RNA, and then becomes exposed in the dimeric RNA, providing an elegant mechanism for coupling genome dimerization and encapsidation [
87].
As is clear from the above discussion, structural studies of the immature virion are still at an early stage, and important future goals include: (1) determining high resolution structures of Gag in its soluble dimeric and trimeric conformations [
43], (2) extending the resolution of models for the assembled hexagonal Gag lattice, and (3) understanding the mechanism by which the Gag lattice curves into a spherical assembly (e.g., through the inclusion of pentameric declinations and/or through irregular juxtapositions of hexagonal sheets). These are challenging problems, as retroviruses are irregular, yet highly organized assemblies, and progress will therefore continue to require the coordinated application of multiple different structural methods.