The yeast deletion array and the synthetic genetic array (SGA) approach
Since genomes of a variety of organisms are fully sequenced, researchers have knowledge of all predicted open reading frames (ORFs). These gene models have allowed the creation of genome-wide resources where most genes are deleted or can be systematically perturbed. RNA interference (RNAi) is typically used to knockdown gene function by reducing levels of messenger RNA (mRNA). In
Caenorhabitis elegans,
Drosophila melanogaster, human and mouse, genome-wide RNAi libraries exist to reduce expression of each gene [
28–30].
In budding yeast, more directed approaches have been used to generate strain collections where each full length gene is deleted. Here, researchers take advantage of the high intrinsic rate of recombination to replace each gene with a dominant antibiotic resistant cassette. The first construction of a complete gene-deletion library was reported in budding yeast,
S. cerevisiae, where a kanamycin resistance cassette (KanMX) with flanking regions homologous to each yeast ORF was targeted to replace each coding sequence [
31]. Strategies have been used to create gene-disruption libraries in other organisms like
Schizosaccharomyes pombe (available from BiONEER),
E. coli [
32] and
Cryptococcus sp. [
33].
Characterizing the library of yeast deletion strains revealed that ~20% of all 6000 yeast genes are essential for haploid viability, only about half of which were previously known [
31]. This remarkable result revealed that extensive redundancy exists in the genome that buffers the consequence of single gene deletions. This observation fuelled development of a high-throughput method for genetic manipulation of the yeast deletion library termed the synthetic genetic array (SGA) approach [
34,
35]. Initially, this approach was used to combine a query mutation of choice with each yeast deletion strain to create an output array of double mutants where arrayed colonies could be scored for a growth defect that is more severe than each single gene deletion (so called synthetic lethal or synthetic sick interactions). The SGA method has been described in detail elsewhere [
36]. Key features of the SGA approach are: (1) it is automated by use of robotics to replicate arrays of yeast colonies onto different selection media; (2) since yeast deletion mutants are arrayed, the position of each yeast mutant is known; (3) yeast of
MATα mating type with a marked allele ‘A’ are mated to the array of
MATa yeast deletion mutants to produce diploid yeast strains which are then sporulated and (4)
MATa meiotic progeny with the marked query allele ‘A’ combined with each arrayed deletion mutant are preferentially selected because they contain a
STE2 promoter driving expression of an auxotrophic marker gene that is only expressed in
MATa cells, allowing them to grow on media lacking the appropriate amino acid [
34,
35]. The net result of these features is that arrays of yeast mutants harbouring a query mutation (such as a deletion allele) or reporter gene of interest (see below) can be rapidly produced through a series of simple pinning steps.
Combining automated genetics with reporter genes
To combat many of the problems associated with classical forward genetic screens (described above), we recently described methods for carrying out systematic reverse genetic screens to identify regulators of a promoter of interest using array-based functional genomic tools and resources in yeast [
37]. The method involves use of the SGA procedure to introduce a reporter gene of interest from an otherwise wild-type query strain into the ordered array of ~4500 known yeast deletion mutants. We first applied the method with two types of reporter genes, which are based on a colourimetric assay or auxotrophy. In one case, the activity of a promoter fused to
lacZ is assessed in each yeast deletion mutant by replica pinning the entire array onto medium containing the substrate X-gal (see above). As noted earlier, on this medium, deletion of genes encoding repressors of the promoter driving
lacZ expression will result in higher levels of
lacZ transcription and thus greater β-galactosidase activity leading to very blue colonies. If the deleted gene encodes an activator of the promoter,
lacZ transcription will be reduced leading to a white colony colour.
A second reporter gene is based on
HIS3 auxotrophy. Cells that cannot produce histidine because they lack the
HIS3 gene must be grown on media supplemented with histidine. In this case, a reporter gene harbouring a promoter fused to
HIS3 is introduced into the yeast deletion array using the SGA methodology and deletion of genes that allow growth on medium lacking histidine are scored as repressors of that promoter [
37]. This type of screen is useful only if the promoter driving
HIS3 is weak enough so that a growth defect is seen when cells are grown on medium lacking histidine or a mutation is made so that the promoter driving
HIS3 is inactivated.
The
HIS3 reporter gene approach was used to screen for new regulators of the SCB element, which is bound by the transcription factor SBF (see above). In this case, an SCB-
HIS3 reporter gene was introduced into a strain with a
CLN3 deletion [
38]. The absence of the cyclin
CLN3 prevents activation of SCB-dependent transcription, thus cells cannot grow in the absence of histidine because of the failure of the SCB element to drive transcription of the
HIS3 gene. The SGA methodology was used to combine the reporter gene and marked
CLN3 deletion allele with each yeast deletion mutant. The resulting array of yeast mutants was screened on media lacking histidine and a new repressor of G1 transcription was discovered. Deletion of
WHI5 resulted in growth of colonies because the absence of Whi5 relieved repression of the SCB element (caused by the
CLN3 mutation) and allowed
HIS3 transcription [
38]. Follow up experiments revealed that Whi5 association with SBF represses transcription but upon CDK phosphorylation, Whi5 dissociates from SBF to allow expression of late G1 genes [
38]. This study defined a pathway in yeast analogous to the Rb-E2F pathway in mammalian cells, which is often targeted in many types of tumours.
Although these types of reporter screens have proven useful, they are generally not quantifiable making them difficult to employ in large-scale studies, which led us to develop our next generation R-SGA screening platform based on the use of fluorescent proteins. The recent engineering of fluorescent proteins in a variety of different colours across non-overlapping spectral classes has made multi-colour cell biological experiments feasible [
39,
40]. Additionally, fluorescent proteins are easy to detect with the correct optics and fluorescent signals can be rapidly quantified, making them useful markers of a variety of cellular events.
The fluorescent protein-based R-SGA system [
41] includes a query strain that harbours any promoter of interest fused to GFP on a plasmid as well as an integrated control promoter fused to RFP (). The query strain is crossed to the collection of ~4500 viable haploid yeast deletion mutants using the SGA platform, resulting in an output array in which each deletion mutant contains both reporter genes. By constructing such an output colony array, the effect of each yeast deletion mutant on reporter gene activity is easily assessed by scanning both GFP and RFP fluorescence intensities directly from colonies arrayed on agar plates using a scanning fluorimager. After quantifying these data, the GFP intensity captured from each colony can be standardized to the RFP signal from that same colony to identify deletion mutants causing differential GFP expression. It is expected that deletion of a putative activator will result in a decreased GFP:RFP ratio while deletion of a putative repressor will result in an increased GFP:RFP ratio. This methodology allows a quantitative and unbiased survey of the genome to identify both direct regulators and upstream signals and pathways that impinge upon a promoter of interest.
The R-SGA approach was applied to study transcription of histone gene expression [
42], a group of genes whose transcripts are tightly cell-cycle-regulated during S-phase [
43]. We fused the promoter of the histone H2A gene (
HTA1) to GFP and carried out a R-SGA screen to identify
HTA1-promoter regulators [
42]. This screen identified the known
trans-acting regulatory proteins (Hir1, Hir2, Hir3 and Hpc3 or HIR) that repress
HTA1 transcription by acting through a negative (NEG) regulatory site in the promoter [
44–49]. The screen also led to the discovery of another histone chaperone protein, Rtt106, which was required for repression of
HTA1 transcription. Follow-up studies revealed that Rtt106 functions in a pathway with the known repressors (Asf1 and HIR) to limit expression of the HIR-regulated histone genes to S-phase of the cell cycle [
42]. Repression by Rtt106 appears to be controlled through the NEG sequence in HIR-regulated histone gene promoters since
HIR1 is required for Rtt106 recruitment to a region of the promoter that encompasses the NEG site and previous work has shown that the NEG sequence is required for Hir1-mediated cell cycle regulation of histone genes [
48].
Other large-scale approaches were carried out to further characterize the molecular defect caused by deletion of histone chaperones on gene promoters and expression. Genome-wide nucleosome occupancy experiments [
6] carried out in
rtt106Δ and
hir1Δ mutants revealed that histone promoters are generally nucleosome-free (except the
HTA2-
HTB2 promoter which is regulated independent of the HIR complex) [
42]. Other promoters were also identified in both of these mutants that appear to be depleted of nucleosomes indicating that HIR and Rtt106 dictate regions of repressive chromatin at other promoters in addition to those that drive expression of histone genes. A number of promoters unique to each mutant are also nucleosome-depleted indicating that other pathways likely exist where HIR and Rtt106 assemble nucleosomes at promoters independently of each other. In future work, genome-wide localization studies using ChIP-seq on TAP-tagged versions of Rtt106 and Hir1 will be important for determining the promoter regions bound by these proteins. These data coupled with nucleosome occupancy data already generated and gene expression profiling should elucidate the collaborative and independent roles these proteins are playing at promoters throughout the genome.
The functional genomics screen carried out to identify specific regulators of the
HTA1 promoter, revealed that deletion of
YTA7 caused decreased GFP expression compared to the RFP control gene [
42], although previous work suggested a role for Yta7 is proper repression [
50]. Yta7 is a bromodomain-containing protein that acts as a barrier protein on chromatin at the
HMR locus in yeast [
51]. Detailed analysis of Yta7 function at the
HTA1 promoter revealed that it localizes to both the NEG-containing region of the promoter as well as the coding region. When
YTA7 is deleted, Rtt106-TAP spreads laterally across the promoter region and into the
HTA1 ORF, where it is normally not present, revealing a role for Yta7 in properly localizing Rtt106 to the histone promoter. It is likely that the lateral spread of Rtt106 into the coding region is propagating a repressive chromatin structure or is interfering with RNA polymerase II elongation which explains the
HTA1 transcriptional defect observed in the
YTA7 deletion strain.
The results described above reinforce the need for carrying out quantitative and unbiased reporter screens to identify new avenues of research leading to discoveries on how genes are regulated at the level of transcription. By screening an array of almost all viable yeast deletion mutants for an effect on a GFP reporter gene relative to an RFP control gene, virtually all known regulators of histone HTA1 transcription were identified along with new regulators that could be placed in a common pathway using traditional genetic and biochemical approaches. Carefully selected promoters can be screened using this approach to understand transcriptional regulatory networks that underlie specific cellular processes. For example, in our work, we are screening a panel of ~30 cell-cycle-regulated promoters that are controlled by known cell cycle transcription factor pathways in an effort to describe a detailed interplay of direct and upstream regulators that control cell cycle transcription (P.K. and B.J.A., unpublished data).