Transcription initiation by most eukaryotic and archaeal multi subunit polymerases requires TFIIB or a TFIIB-related general transcription factor. TFIIB and TFIIB-like proteins (archaeal Tfb and the Pol III factor Brf) operate at multiple key steps during transcription initiation such as Pol binding, preinitiation complex formation, start site selection, promoter opening, and abortive initiation (
1–
4). Because of the fundamental and conserved role of TFIIB-like factors for other Pols, it was surprising that none of the Pol I-specific subunits or general initiation factors were known to share TFIIB homology.
Transcription of the
S. cerevisiae rDNA locus by Pol I requires four general transcription factors: UAS-binding Upstream Activity Factor (UAF), TATA-binding protein (TBP), Core Factor (CF) and the regulatory factor Rrn3 (
5). CF contains subunits Rrn6, 7, and 11 (
6,
7), and interacts with UAF, TBP, Rrn3, and Pol I (
5). CF is analogous to the human Pol I factor SL1, composed of four TBP-associated factors (TAFs), TAF1A, TAF1B, TAF1C, and TAF1D (
8,
9), where TAF1A, B and C are orthologous to yeast CF subunits Rrn6, 7, and 11 (
10). Both CF and SL1 recruit Pol I to its promoter but the function of the CF/SL1 subunits, apart from TBP and Pol binding, is unknown (
5,
11,
12).
We guessed that a Pol I TFIIB-like factor would have diverged considerably from related factors (
13) since Pol I subunits share relatively low protein sequence conservation with their Pol II and Pol III counterparts (
14). Using the homology detection program HHpred, which uses pairwise hidden Markov model profile comparisons that are more sensitive than traditional web-based approaches (
15), we detected high probability matches between the Rrn7 N-terminal 320 residues and the TFIIB family, indicating that Rrn7 is a TFIIB paralog. The probability scores ranged from 95%–98%, including a 100% match with the human SL1 subunit TAF1B (;
Fig S1). No significant homology was detected between Rrn6 or Rrn11 with other Pol or basal factor subunits. Rrn7 and TAF1B share low sequence conservation between TFIIB family members and each other (8–16% identity), whereas the predicted secondary structures are similar (,
Fig S2–S4). Rrn7 and TAF1B form a distinct Pol I specific clade, consistent with their sequence divergence from the other TFIIB-related proteins (
Fig S5A). A plant-specific TFIIB-related protein (Brp1) was described to function in Pol I transcription since it crosslinks to the Pol I promoter in vivo and bound Pol I in vitro (
16). However, Rrn7 and TAF1B match with 100% probability to Mee12, an uncharacterized TFIIB-related plant protein, while Brp1 is more TFIIB-related (
Fig.S5B–D). This is consistent with a conserved Pol I function of Rrn7/Taf1B/Mee12 in all eukaryotes and plant-specific functions for the Brp proteins (
16,
17).
The TFIIB homology domains (BHDs) of Rrn7 and TAF1B contain the same four subdomains found in all TFIIB-like proteins: the zinc ribbon, B-reader, B-linker, and cyclin folds (). For TFIIB, the ribbon domain binds Pol II, the reader and linker domains are situated in the Pol II active site where they are involved in open complex formation and transcription start site selection, and the cyclin domains bind TBP, Pol II and promoter DNA (
2,
3). The Brf proteins also contain a large C-terminal domain (CTD) that is essential for function, and mediates interaction between Pol III and the TFIIIB subunits Bdp1 and TBP (
18,
19). Rrn7 and TAF1B contain a large CTD (
Fig S6,7), but the Rrn7 CTD sequence is yeast-specific and the TAF1B CTD is conserved among metazoans. Both are unrelated to the Brf CTD.
The function of Rrn7 derivatives with deletions and single amino acid substitutions was tested in a yeast strain containing a chromosomal
rrn7 deletion and where the 35S rRNA precursor is transcribed by Pol II from the
GAL7 promoter (
20). Strains lacking Pol I transcription activity are dependent on galactose for survival and cannot grow on glucose media. Disruption of every Rrn7 subdomain was lethal, indicating that all TFIIB- and Brf1-like domains are essential (
Fig S8).
Since TFIIB-like factors directly interact with their respective Pols, we immobilized yeast Pol I via the second largest Pol I subunit (Rpa135) and measured the binding of GST-Rrn7 derivatives (). Full-length Rrn7 and the separate BHD and CTD domains specifically bind immobilized Pol I. This Pol-binding behavior is similar to Brf1, where both its BHD and CTD bind Pol III (
21). The TFIIB ribbon domain binds with high affinity to the Rbp1 dock domain, where it positions the adjacent reader and linker regions in the enzyme active site (
2,
3,
22,
23). As predicted for a TFIIB-like factor, the Rrn7 ribbon bound strongly to Pol I, whereas binding was severely reduced by a mutation in two Zn-coordinating residues ().
The Pol II (Rbp1) and Pol III (Rpc1) dock domains are moderately conserved with Pol I (Rpa190; 34–40% identity); however, their predicted secondary structure is nearly identical (). Replacement of Rpa190 dock residues with Glycine-Serine linkers (Δ1, Δ2; ) gave moderate to severe growth defects whereas 2 out of 3 multiple alanine substitutions gave moderate growth defects (, compare A2,3 and A1,2,3 to A1,2). There was high correlation between alanine substitutions that reduced Rrn7 ribbon-Pol I binding and growth defects (A2,3 and A1,2,3; ). Together, our findings suggest that the Rrn7 ribbon interacts with the Rpa190 dock domain, similar to the TFIIB ribbon-Pol II dock interaction.
TAF1B displays several differences from S. cerevisiae Rrn7, including a very short B-linker and a large insertion within the first cyclin repeat (, blue highlight). However, we note the low sequence similarity between Rrn7 and TAF1B in the linker and at the beginning of the first cyclin repeat, so the boundary between these domains in TAF1B is not certain. Substitution of the entire TAF1B BHD for the analogous domain in Rrn7 complemented yeast growth (), whereas substitution of the TAF1B CTD did not, explaining why full-length TAF1B will not substitute for Rrn7.
To test if any of the TFIIB or Brf1 domains could functionally substitute within Rrn7, we generated Rrn7 derivatives containing segments of TFIIB or Brf1. We found that the TFIIB and Brf1 B-linkers complemented Rrn7 function, (;
Fig S9), and the Rrn7 B-linker could functionally substitute within TFIIB or Brf1 (). The TFIIB linker forms a helix proposed to lie at the junction of single and double stranded DNA in the open complex state (
2,
3). Since B-linkers from the TFIIB-related factors substitute for one another, they may all fulfill this function, despite heterogeneity in length and sequence. Consistent with this proposal, we found that the TFIIB linker mutation L110P, which disrupts TFIIB function (
2), also disrupted linker function in the Rrn7-TFIIB chimeria ().
The TFIIB and Brf1 cyclin and B-reader domains could not functionally complement the analogous Rrn7 domains (
Fig S9). However, the Brf1 and Rrn7 ribbon domains were interchangeable, whereas the TFIIB ribbon was not. (). These results show that Rrn7 is more functionally related to Brf1, consistent with their similar domain architecture, interchangeable ribbon domains, and BHD and CTD-mediated Pol binding.
Rrn7 shares many basic properties with TFIIB and Brf1 despite significant sequence variation. The finding that the Rrn7 and Brf1 ribbon domains are interchangeable suggests that they promote the same functions, such as providing an anchor, so that the reader and linker segments are positioned in the active site to facilitate DNA opening. In contrast, the TBP binding activity of CF seems split between Rrn7 and Rrn6 (
6,
12). It remains unclear whether the Rrn7 cyclin repeats bind TBP and whether they interact with Pol I, or if these functions have been distributed among other CF subunits.
All Pols must overcome similar obstacles to initiate transcription including specific promoter binding, template opening, transcription start site recognition, and the transition to processive elongation. The best understood TFIIB family member is TFIIB, where each subdomain performs a specific and essential role in PIC formation, promoter opening and/or start site selection. Bacterial Pol, which lacks a TFIIB-related factor, uses σ factor for many of the same functions in promoter binding, open complex formation and initiation. Although there is no sequence or significant structural similarity, TFIIB and σ70 show striking topological similarities in the way they bind to their respective Pols in near identical locations (
2,
3). Combined with previous findings, our results point to the conservation of initiation mechanisms among all multisubunit Pols.