The retroviral lifecycle depends on insertion of viral DNA into a host cell chromosome, and integrase (IN) is the enzyme that orchestrates the key catalytic events involved in this process (reviewed in
Craigie, 2002;
Lewinski and Bushman, 2005). IN is responsible first for 3′-processing, the reaction in which two or three nucleotides are removed from one or both 3′-viral DNA ends, leaving 3′-hydroxyl groups. IN subsequently catalyses strand transfer, wherein it uses the 3′-hydroxyls to attack a pair of phosphodiester bonds in host cell DNA.
Retroviral INs contain three canonical domains: a zinc-binding N-terminal domain, a catalytic core domain harbouring the active site and a C-terminal domain (
Engelman and Craigie, 1992;
Kulkosky et al, 1992;
Hare et al, 2010a). The IN active site contains a triad of invariant acidic residues, referred to as the D,DX
35E motif (
Engelman and Craigie, 1992;
Kulkosky et al, 1992;
Dyda et al, 1994). The carboxylates coordinate a pair of divalent metal cations, essential for both enzymatic activities. Abundant
in vivo, Mg
2+ is considered to be the natural cofactor, although Mn
2+ fully supports IN function
in vitro (
Bushman and Craigie, 1991;
Engelman and Craigie, 1995;
Andrake et al, 2009;
Hare et al, 2010a). Both reactions catalysed by the IN active site proceed via bimolecular nucleophilic substitution (S
N2), shared by metal-dependent nucleotidyl transferases and some nucleases, including transposases and RNase H enzymes (
Engelman et al, 1991;
Mizuuchi and Adzuma, 1991;
Davies et al, 2000;
Kennedy et al, 2000;
Nowotny and Yang, 2006). The metal ion cofactors are thought to play dual roles during catalysis. Owing to the preferred octahedral geometry of the Mg
2+ and Mn
2+ coordination spheres (
Harding, 2006), the ions initially help to select and position the reacting groups. Second, they help to destabilize the scissile phosphodiester and promote formation of the phosphorane intermediate (
Nowotny and Yang, 2006;
Yang et al, 2006).
Retroviral integration shares a common set of intermediates with many DNA transposition systems () (
Li et al, 2006;
Li and Craigie, 2009). Initially, a tetramer of IN assembles on the viral DNA ends, forming the intasome. Following 3′-processing, the intasome binds target DNA, in a transient target capture complex (TCC). Following strand transfer, the post-catalytic strand transfer complex (STC) likely requires energy-dependent disassembly prior to 5′-end joining by the host DNA repair machinery. In the context of this work it is helpful to discriminate between two forms of the intasome: its initial state containing unprocessed (blunt) viral DNA ends and its post-3′-processing state. Herein, we refer to these complexes as uncleaved and cleaved intasomes (UI and CI), respectively. The intasome in its post-3′-processing state (CI), TCC and STC were recently structurally characterized using the IN from prototype foamy virus (PFV) (reviewed in
Cherepanov et al, 2011). The CI comprises a dimer-of-dimers of IN, wherein the central pair of IN subunits form a network of protein–protein and protein–DNA interactions and engage the processed 3′-viral DNA ends within their active sites (
Hare et al, 2010a). All three canonical domains of the inner IN subunits participate in protein–protein and protein–DNA interactions. The outer IN chains interact with the inner chains via the catalytic core domains. In the TCC, target DNA binds in a groove formed between the inner IN subunits in a severely bent conformation, forced to yield its target phosphodiesters to the IN active sites (
Maertens et al, 2010).
HIV-1 IN is a validated drug target with raltegravir currently being used for treatment of AIDS, and several related inhibitors are in clinical trials (
Summa et al, 2008) (reviewed in
Marchand et al, 2009;
McColl and Chen, 2010). These small molecules are classed as IN strand transfer inhibitors, as they specifically target the second catalytic step of the integration process (
Espeseth et al, 2000). Although inhibition of 3′-processing can be observed in the presence of elevated concentrations of these molecules (
Metifiot et al, 2010), it is unlikely to contribute to their antiviral activity. Strand transfer inhibitor scaffolds comprise two functional moieties, a heterocyclic core displaying a triad of metal chelating atoms (typically three oxygens) and a halo-benzyl side chain, connected via a short, torsionally flexible linker. A high degree of sequence conservation in the active sites of retroviral INs allowed the use of the PFV intasome as a surrogate for its HIV-1 counterpart (
Hare et al, 2010a,
2010b,
2011). In the PFV CI-inhibitor cocrystal structures, the metal chelating triad interacts with the Mg
2+ cations in the IN active site, while the halo-benzyl side chain displaces the base of the reactive deoxyadenosine at the processed 3′-viral DNA end (
Hare et al, 2010a,
2010b,
2011).
Our prior work focused on post-catalytic (CI and STC) or inactivated (TCC lacking 3′-hydroxyls or catalytic metal ions) IN–DNA complexes (
Hare et al, 2010a;
Maertens et al, 2010). Herein, we present crystal structures of the functional UI and TCC in their ground states committed for 3′-processing and strand transfer, respectively. The structures provide unprecedented insight into the positions of the metal ions and chemical reacting groups in the IN active site, highlight a substrate mimicry utilized by strand transfer inhibitors in their mode of binding to IN and explain why these small molecules are ineffective against the 3′-processing reaction and why inhibitors of 3′-processing have been more difficult to develop.