Trypanosomatid parasites of the genera
Trypanosoma and
Leishmania cause devastating diseases in humans and livestock. During infection, these organisms essentially depend on gene expression mechanisms not found in their hosts. In trypanosomatids, nuclear protein-coding genes are transcribed polycistronically and individual mRNAs are excised from large precursors by
trans splicing and polyadenylation (reviewed in reference
22).
trans splicing in which the capped 5′-terminal spliced leader (SL), also known as the mini-exon, is cleaved off the SL RNA and fused to the 5′ end of a pre-mRNA is an essential maturation step for all trypanosomatid mRNAs. Since the SL RNA is consumed in the process, trypanosomatid organisms crucially depend on a high level of SL RNA synthesis, making it a promising target for parasite-specific inhibition. To accommodate the high synthesis rate, a trypanosome cell harbors approximately 200 SL RNA gene copies, which are transcribed by RNA polymerase II in a monocistronic fashion (
3). The anatomy of the SL RNA gene promoter has been meticulously investigated in the three trypanosomatid species
Trypanosoma brucei (
7),
Leptomonas seymouri (
8,
13), and
Leishmania tarentolae (
38); it is conserved and consists of a bipartite upstream sequence element (USE) and a putative initiator element (
23) at the transcription initiation site.
Trypanosomatids diverged from the main eukaryotic lineage very early in evolution (
11,
34). As a consequence, protein identification in trypanosomatid genome databases is only successful for the most conserved proteins, such as the TATA-binding protein (TBP)-related factor 4 (TRF4). Chromatin immunoprecipitation revealed that TRF4 is associated with SL RNA gene sequences, and an RNA interference analysis indicated that TRF4 functions in SL RNA gene transcription (
30). A second factor associated with SL RNA gene transcription was biochemically purified in
L. seymouri (
2). The factor was termed PBP-1; it consists of three subunits with apparent molecular masses of 57, 46, and 36 kDa, and it binds specifically to the SL RNA gene USE (
25). While p36 was not characterized, p46 appeared to be a parasite-specific protein (
2). Conversely, p57 was identified as a divergent orthologue of SNAP50, a subunit of the human small nuclear RNA (snRNA)-activating protein complex (SNAP
c [reference
9]) also described as a proximal sequence element-binding transcription factor (
37), suggesting that PBP-1 is a SNAP
c-like factor (
2).
Human SNAP
c is an essential factor for RNA polymerase II- or III-mediated transcription of snRNA genes (reviewed in reference
10). The protein complex binds to the proximal sequence element of snRNA gene promoters and consists of five subunits, three of which are essential for transcriptional activation, namely SNAP190, SNAP50, and SNAP43. The recent characterization of
Drosophila melanogaster SNAP
c revealed orthologues to these three proteins but no additional subunits (
21). The sequence conservation between human and insect SNAPs is limited, and only functionally important domains exhibit substantial similarity (
21). In addition to SNAP
c, human snRNA gene transcription essentially depends on the basal transcription factors TBP, transcription factor IIA (TFIIA), TFIIB, TFIIF, and TFIIE (
15). Orthologues of these factors may be involved in trypanosome SL RNA gene transcription as well because the 139-nucleotide-long SL RNA belongs to the class of snRNAs. However, except for the TBP homologue TRF4, they have not been identified in trypanosomatid genome databases which are virtually complete for
T. brucei and
Leishmania major.
We have recently cloned and epitope-tagged TbSNAP50, the
T. brucei orthologue of human SNAP50 and
L. seymouri p57, and confirmed that this protein binds to the SL RNA gene USE of
T. brucei (
32). Here, we have created a novel tag combination for tandem affinity purification (TAP) termed the PTP tag. By employing the PTP tagging and purification method initially with TRF4 and subsequently with other subunits, we isolated a stable multisubunit complex which directs SL RNA gene transcription in a cell extract. The complex consists of TRF4, three SNAP
c subunits, the
T. brucei orthologue of the small TFIIA subunit, and a sixth protein which appears to be an extremely divergent orthologue of the large TFIIA subunit.