Viruses in the family Retroviridae,
e.g. human immunodeficiency virus (HIV), simian immunodeficiency virus, avian sarcoma virus, and equine infectious anemia virus (EIAV),
3 utilize reverse transcription to replicate the genetic information stored in their RNA genome (
1). Mutational analyses indicate that the viral capsid plays a critical role in this event because even small structural changes can disrupt the process (
2–
8). A retrovirion cycles through morphological transitions: viral particle assembly, budding, and maturation occurs in producer cells; the particle is disassembled in the newly infected target cell. At the onset of HIV-1 assembly, the structural precursor polyprotein, Gag, assembles underneath the plasma membrane (
9–
11). During or after budding, maturational changes proceed wherein the full-length Gag proteins in the assemblages are cleaved by viral-encoded protease (PR) to generate three major products, matrix (MA), capsid (CA), and nucleocapsid (NC). These are arranged to comprise a MA shell underlying the lipid envelope and a conical (
e.g. HIV-1, simian immunodeficiency virus, EIAV,
i.e. lentivirus subgroup) or spherical (
e.g. avian sarcoma virus) CA core that encapsulates the genome, which is bound to NC (
12–
14). For all retroviruses the N-terminal 12–13 residues of CA, unstructured within Gag, refold into a β-hairpin structural motif stabilized by a Pro
1-Asp
51 (HIV-1) salt bridge (
2). The salt bridge involving the conserved residue proline 1 is essential and any mutations affecting salt bridge stability will unfold the β-hairpin and cause defects in the assembled capsid that eventually result in noninfectious virions (
2–
7). Correct assemblage of native CA proteins during viral maturation ensures protection and integrity of the packaged viral genome. After virion entry into another host cell, the capsid disassembles to allow its genomic RNA to be released and reverse transcribed. This step requires the structural integrity and correct assembly of CA proteins (
15,
16).
Successful viral maturation therefore depends on folding of the β-hairpin motif and the reversible quaternary packing of CA proteins. A retroviral CA protein consists of two helical domains, the 17-kDa N-terminal (CA
N) and the 9-kDa C-terminal domains (CA
C). For HIV-1, the wild-type (wt) CA protein exists in solution mainly as dimers and some minor amount of other oligomers (
17–
21). High-resolution structures have been obtained for the individual retroviral CA domains (
22–
29) and the monomeric full-length CA (
30–
33). Interestingly, the sizes, secondary, and tertiary structures of different CA proteins are highly conserved (
34). The hexagonal and pentagonal assembly (
35,
36) of the full-length CA through CA
N domains was revealed by cryoelectron microscopy (EM) and crystallography (
37–
42). Further dimer or trimer formation of CA
C domains connects hexamers and pentamers (
38,
41). Unexpectedly, the β-hairpin motif does not form any inter-molecular contacts within high resolution CA quaternary structures.
To understand the contribution of the β-hairpin motif to CA assembly, we applied solution NMR spectroscopy to study the CA protein from EIAV, a lentivirus sharing the similar conical capsid core (
12) and assembly (
43) structure as HIV-1. Different from HIV-1-CA, EIAV-CA dimerizes much weaker in solution, which results in sharper NMR resonance peaks and is more suitable for study by solution NMR. Previous results on HIV-1-CA showed that any point mutation, deletion mutation, or N-terminal extension at Pro
1 would affect the Pro
1-Asp
51 salt bridge and unfold the β-hairpin (
2). The published crystal structure of EIAV-CA is an N-terminal 3-residue (Pro
1-Met
3) deletion mutant with an unfolded signature β-hairpin (
44). Instead of using this deletion mutant, which lacks resonances from residues Pro
1 to Met
3, we studied a N-terminal histidine-tagged variant that could also unfold the β-hairpin structure, in addition to the wt full-length EIAV-CA, to identify the structural differences associated with the β-hairpin formation. Our results showed that the maturational refolded β-hairpin induces a coil-helix transition in residues N-terminal to helix α1, Pro
17-Gly
19. The same region was also identified to be the expanded interface for the sparse CA
N domain oligomerization. The Thr
16-Gly
19 sequence is highly conserved in HIV-1-CA as well, and Arg
18 of HIV-1-CA has been demonstrated to be essential (
45) and form the inner-core of the CA
N hexamer (
38). Therefore we propose the function of refolding the β-hairpin, the prerequisite event for retroviral CA assembly, is to extend helix α1 at the N terminus to enhance CA
N oligomerization for assembly. Our data further suggested the CA
N oligomerization is kinetically slower and uncorrelated to the faster domain CA
C dimerization, previously not known in retroviral capsid assembly events (
46).