Combining all the results, we reconstructed a new transcriptional regulation model for the yeast cell cycle (Figure ). There are three general features of this combinatorial control that we would like to point out: waiting-activating systems; joint-phase combinations; and joint-process combinations. A waiting-activating system is an apparatus that waits for some signal in a repressed state and then activates transcription. Several of the transcription factors we have studied seem to bind to their targets in a repressed state before any signal. If a signal occurs, they activate transcription. Examples are: {Fkh2, Mcm1, Ndd1}, which is repressive before the signal generated by CLB kinase activity [
20]; {Hir1/Hir2, Swi/Snf} at histones, which is likewise repressive [
46] until the beginning of DNA synthesis; {Ste12, Dig1} [
35], which is bound to promoters in an inhibited state even in the absence of any signal for mating or pseudohyphal growth; and the SBF and MBF factors, which bind to their target genes early in G1, but which only induce transcription when the complex of cyclin Cln3 and the protein kinase Cdc28 is activated in late G1 [
50]. Wyrick and Young [
51] have also suggested that the pre-binding of an inhibited activator may be a general feature of activators. The mechanisms of repression and activation are probably different in these various cases, but the objective is the same, to wait for a signal and then activate transcription.
A second feature is the existence of joint-phase combinations. By this we mean that some gene promoters are bound by one regulator that works primarily in the previous cell-cycle phase, and also by a second regulator that works primarily in the next cell-cycle phase. Examples are the combinations {SBF, Fkh2} for S-phase regulation, {Fkh2, Mcm1, Ndd1} for G2/M phase regulation, and {Mcm1, SBF} for M/G1 regulation. SBF is largely a G1-phase regulator, and Fkh2 is largely a G2-phase regulator. Yet there is a distinct group of genes expressed in S-phase that depends on the combination of SBF and Fkh2. Similarly, Mcm1 is primarily an M-phase regulator, but there is a large group of genes in G2/M that depends on {Fkh2, Mcm1, Ndd1}. Finally, the M-phase regulator Mcm1 combines with the G1-phase regulator SBF to regulate some genes in M/G1. Although some of these joint-phase combinations had been pointed out previously, we have found new combinations and many new examples. We can now see that the number of cell-cycle genes regulated by a combination of transcription factors may be as large or larger than the number of genes regulated by a single factor.
A critical issue with these joint-phase combinations is whether the two regulators work independently or cooperatively. That is, for a gene that is bound by SBF and Fkh2, is the gene turned on by SBF, and also independently turned on by Fkh2? Or does gene activation require both factors simultaneously? For targets of {Fkh2, Mcm1, Ndd1}, it appears that activation is cooperative, not independent [
20]. Furthermore, in the case of most joint-phase S genes and M/G1 genes, it appears that the peak of gene expression is sharp rather than broad [
1] (that is, expression occurs only when both factors are simultaneously active, not over the whole time that either one or the other of the factors is active) again suggesting cooperativity rather than independence. Although a physical interaction between two factors is often the basis of cooperativity, other mechanisms might also play a part.
When these transcription factor combinations are connected, the resulting chain suggests that regulation is circularly relayed from an earlier TF to a later TF through their combination. Namely, it is relayed from Swi4 (SBF) to Fkh2 via {SBF, Fkh2}, Fkh2 to Mcm1 via {Fkh2, Mcm1, Ndd1)}, Mcm1 to Swi4 via {Mcm1, Swi4}, and so forth (Figure ). This feature is complementary to the finding of Simon
et al. [
14]. They found that transcription activators that function during one stage of the cell cycle regulate transcription activators that function during the next stage. Whereas their finding is primarily focused on regulation between TFs, our chain connected by joint-phase combinations shows that the serial regulation of target genes is relayed through TF-motif combinations.
The apparent ability of cell-cycle factors such as SBF and Fkh2 to function cooperatively has some interesting consequences. It means that two factors can generate at least three peaks of expression: the SBF-only peak, the SBF plus Fkh2 peak and the Fkh2-only peak. But one can also imagine that some promoters might require both factors for gene expression, but have stronger motifs for one factor than the other. Thus a gene with a strong SBF motif and a weak Fkh2 motif might be expressed only in late S (when sufficient Fkh2 has accumulated to bind even the weak motif), while a gene with a weak SBF motif and a strong Fkh2 motif might be expressed only in early S (because later, SBF abundance might be too low to interact with the weak motif). Thus by using cooperativity, the cell could generate a continuum of peaks of expression over time using a small number of factors, and a large number of varied motifs, exactly as observed. Molecular experiments will be required to investigate this issue.
Finally, we note the existence of joint-process combinations, by which we mean combinations of TFs that allow genes to respond to two (or more) different transcriptional programs. Although cells undoubtedly have many such combinations, in our dataset the main examples involve Ste12, a regulator of the mating or pseudohyphal growth pathways. For instance, {SBF/MBF, Ste12} in G1 probably controls genes needed for G1 phase, but also independently needed for mating. Similarly {Swi5, Ste12, Dig1} in M/G1 may control genes needed for the M/G1 transition, but also important for either mating or for pseudohyphal growth.
In summary, we have extended the understanding of the yeast cell cycle by integrating ChIP-microarray analysis with expression analysis and motif-combinatorial analysis. Many of our findings from the integrated analysis confirm the results of previous analysis, hence validating our approach. However, we believe that the success of the non-integrated approaches was possible in part because S. cerevisiae has a small genome, its genes have very small regulatory regions, and the datasets are unusually good. As this type of genome-wide analysis moves to higher eukaryotes with larger genomes, we believe that non-integrated approaches will not have sufficient power to provide reliable results, whereas this integrated approach has overcome the limitations inherent in each individual approach. The added power of our integrated approach did allow us to find several interesting novel combinations of motifs and TFs, in particular those new joint-phase combinations. These new predictions lead directly to new hypothesis for new experiments. The computational integration of multiple approaches or datasets will be of increasing importance as more kinds of genomic resources, such as genome-wide protein-protein interaction data and comparative genomics data, become available for more organisms. Indeed, for higher eukaryotes, where gene networks are more complex and regulatory regions are larger, we believe that integration of datasets will be absolutely essential.