Insertion sequences (IS) play a preponderant role in shaping prokaryotic genomes. They are ubiquitous and have been identified in high numbers in many bacteria and archaea (
1). Indeed, transposases, the enzymes which catalyse their movement, are by far the most numerous and ubiquitous genes in nature (
2).
We recently described an unusual type of bacterial insertion sequence, the IS
200/IS
605 family, whose members undergo single-strand (ss) transposition. They use obligatory ssDNA intermediates and insert into an ssDNA target (
3–5) such as the lagging strand template of replication forks (
6). The paradigm of this family, IS
608 from
Helicobacter pylori (
7) transposes efficiently in the heterologous host,
Escherichia coli. IS
200/IS
605 family transposases (TnpA) are not related to the well known and best characterized DDE transposases (
8) but are members of the large HUH (histidine–hydrophobic–histidine) endonuclease family that includes viral Rep proteins, conjugative plasmid relaxases and rolling circle replication initiator proteins (
9). All use a catalytic tyrosine residue to attack the target phosphodiester bond creating a covalent 5′-phosphotyrosine enzyme–substrate intermediate (
4,
5,
10).
As for other IS types, these reactions are carried out by a molecular machine, the transpososome or synaptic complex, a key controlling element in transposition (
11) in which the IS ends are assembled into a complex with several transposase protomers. In a number of systems, strand cleavage can only occur once the transpososome is correctly assembled (
12–14), ensuring that adventitious and potentially deleterious DNA cleavages do not occur before conditions for productive transposition are assured.
Our limited knowledge of transpososome architecture and behaviour has been obtained largely from transposable elements which use double-stranded (ds) DNA intermediates and employ DDE transposases (
15–17). These transpososomes undergo a series of orchestrated transformations involving conformational changes leading to the positioning of transposon DNA strands for cleavage, elimination of flanking DNA, target DNA docking and DNA strand transfer. Tn
10 and phage Mu transpososome assembly, for example, is highly ordered and the complexes become increasingly robust and refractory to denaturation as transposition progresses (
18,
19).
Little is known about assembly and behaviour of IS
200/IS
605 family ss transpososomes although structural studies have revealed large conformational changes on DNA binding (
4).
IS
608 TnpA binds subterminal imperfect palindromes, IP
L and IP
R, at the left (LE) and right (RE) ends (A and B) located some distance from the cleavage sites (
4,
20) (B). Cleavage is strand specific occurring in the ‘top’ strand (B). Strand transfer generates a circular ssDNA ‘top’ strand transposon intermediate with abutted RE and LE (the transposon or RE–LE junction) (C). This inserts specifically into an ssDNA target 3′ to a TTAC tetranucleotide (E) (
3,
7) also required for subsequent transposition (
5). Strand transfer also joins the DNA originally flanking the excised strand to generate a donor joint and preserve the target TTAC without DNA gain or loss (
5) (D). The entire transposition cycle has been reconstituted
in vitro with purified TnpA (
3).
TnpA itself is dimeric both in solution and in the X-ray crystal structure (
4). The IP
L and IP
R binding sites recognize IP structural features and are located on one dimer face and the two active sites on the other (
Supplementary Figure S3E) (
4,
20). Both active sites include amino acids from each monomer assembled by juxtaposition of an alpha helix (D) carrying the catalytic tyrosine (Y127) of one monomer and the two histidines of the HUH motif (that coordinate the catalytically essential divalent metal ion) from the other. In complexes with or without bound oligonucleotides (containing only IP
L and IP
R), the active sites are in an inactive configuration (
4) but the ensemble undergoes a large conformational change if 4

nt 5′ to the foot of the IP are included. This permits divalent metal ion binding and places Y127 in an appropriate position for nucleophilic attack (
Supplementary Figure S3E, right) (
20).
IS
608 shows important asymmetries in both its organization and in its transposition mechanism. Cleavage at LE occurs 3′ to a conserved TTAC located 19

nt 5′ from the foot of IP
L whereas at RE, the cleavage site, TCAA, lies 10

nt and 3′ to the foot of IP
R (B). Astonishingly, these cleavage sites (C
L and C
R) are not recognized directly by the enzyme but via base pairing with four bases (guide sequences: G
L and G
R) that are located 5′ to the foot of IP
L and IP
R respectively (
20,
21). Furthermore, while strand cleavage creates 5′ phospho–tyrosine bonds between TnpA and substrates at both ends (
4,
5), this reaction occurs with the 5′-end of LE but with the 5′-end of the DNA flank at RE (B).
The experiments reported here address questions central to the formation, organization and function of the ss transpososome involved in the excision of IS
608 as an ss circular transposition intermediate. We reconstitute a biologically relevant complex including both LE and RE and TnpA and demonstrate that it is catalytically active. We identify DNA sequences within the ends required for transpososome assembly. Furthermore, we show that the guide sequences, G
L and G
R, and the LE and RE cleavage sites, C
L and C
R, together with a network of canonical (‘Watson and Crick’) and unusual non-canonical base interactions (reminiscent of certain catalytic RNA species (reminiscent of certain catalytic RNA species (
22)) are necessary for assembly of a robust synaptic complex stable during gel electrophoresis. The results provide a detailed picture of the way in which the IS
608 transpososome is assembled.