This study illustrates that the integrase from phage TP901-1 is a robust and autonomous enzyme that can mediate site-specific integrative recombination between compact attB and attP sites in environments remote from its native host, including human cells.
By using an intramolecular integration assay in
E. coli, we characterized minimal recognition sites for the enzyme that were 50 bp long for
attP and just 31 bp long for
attB. These sites are somewhat smaller than the minimal 56-bp
attP site (
5) and 43-bp
attB site (
4) previously reported. In the previous cases, the minimal
att sites were asymmetrically disposed about the 5-bp core, with most of the sequence upstream of the core, whereas the minimal
att sites found in this study are centered on the 5-bp core. Numerous paired direct and inverted repeats have been identified within
attP (
5,
7) and
attB (
4) that may be involved in the integrase binding and recombination function. Our minimal
attB sequence includes the B2 and B3 repeats (
4), although only a single copy of each repeat is present (Fig. ). Our minimal
attP site includes the P1, R4, R5, and R6 repeats (
5) (Fig. ), although again only one copy of each repeat, not the pair, is present. These reduced sites function as well as full-length sites that are over 300 bp long in both
E. coli (Fig. ) and human cells (Table ). The significance of the repeats is therefore unclear. The second copy present in the full-size
att sites might provide enhancement of recombination that is undetectable with our assay because it is fully saturated.
The minimal
attB31 sequence contains the C17 and A25 bases that have been shown to both reduce in vitro binding of the integrase and reduce recombination in
E. coli (
4). The TP901-1 integrase binds with similar affinity in vitro to
attB and
attP (
4), as is also the case for the
![[var phi]](/corehtml/pmc/pmcents/x03C6.gif)
C31 integrase (
28), another member of the extended serine recombinase group. By mutating each base in the
attB 5-bp core, it was revealed that only the first two bases are important (
4). This result suggested that there is a 2-bp overlap region between
attB and
attP during recombination, as has been observed for other members of the serine recombinase family (
24).
Our major interest in the TP901-1 integrase is in its potential as a tool for making directed genomic rearrangements in eukaryotic cells. Our in vitro studies with this enzyme reinforce the hypothesis that it possesses good reaction kinetics and stability and works well at 37°C (Fig. ), making it eligible for use in mammalian cells. The small size of the
att sites is consistent with a lack of cofactor requirements, which is also a highly desirable feature because it simplifies use of the enzyme in foreign hosts. The efficient function of the enzyme in vitro and in
E. coli documented here and recently by other workers (
4) and the positive results for human cells reported here suggest that the TP901-1 integrase should function in most cellular environments in a wide range of species. The enzyme joins the other two integrases of the extended serine recombinase family,
![[var phi]](/corehtml/pmc/pmcents/x03C6.gif)
C31 (
9, 26,
30) and R4 (
19), in this respect. The intramolecular integration activity of the TP901-1 integrase in human cells is within twofold of the activities measured for the
![[var phi]](/corehtml/pmc/pmcents/x03C6.gif)
C31 (
9) and R4 (
19) integrases in similar assays.
This study demonstrated efficient function of the enzyme in an intramolecular integration reaction in human cells (Table ). This type of reaction is useful for creating chromosome rearrangements, such as deletions. We are currently investigating the ability of the enzyme to carry out such reactions in a mammalian chromosomal context. Another major area of interest is the enzyme's ability to carry out intermolecular integration reactions. The enzyme was designed to carry out such reactions in the normal phage life cycle, mediating integration between an incoming phage genome and the host bacterial genome. Plasmid integration vectors carrying
attP mediate intermolecular integration into the native host
attB site at a frequency of ~20% (
6). This integration reaction is largely unidirectional. A second phage protein, an excisionase encoded by
orf7, is required to bring about an efficient reaction between
attR and
attL, which is required for phage excision (
3). To maximize the integration reaction, the excisionase is simply left out.
It may be possible to use the TP901-1 integrase as an integration tool targeted to inserted wild-type
att sites. We have obtained evidence for integration of a plasmid containing the TP901-1
attB site into a TP901-1
attP site that was placed into the genome of human 293 cells (Stoll and Calos, unpublished results). The TP901-1 integrase is therefore capable of intermolecular integration into chromosomes in the human cell environment. As well, the short length of the
att sites makes feasible the idea of using native chromosomal sequences resembling the
att sites as integration targets. Such pseudo-recombination binding sites exist in the case of the Cre recombinase (
29) and the
![[var phi]](/corehtml/pmc/pmcents/x03C6.gif)
C31 and R4 integrases (
19,
30). The sequences of the
att sites recognized by the TP901-1 integrase differ from those recognized by these other two integrases and offer additional points of entry into the genome, especially in AT-rich regions. It may be possible to enhance the reactivity of the enzyme with chromosomal sequences and to improve its affinity for particular native sequences by DNA shuffling (
25) of the integrase gene, as has been shown for the
![[var phi]](/corehtml/pmc/pmcents/x03C6.gif)
C31 integrase (
23). If so, the TP901-1 integrase may become an important tool for engineering the genomes of higher living cells.