FAIRE Positively Selects for Nucleosome-Depleted Genomic Regions
Following phenol-chloroform extraction of formaldehyde-crosslinked yeast chromatin, non-coding regions of the genome are preferentially segregated into the aqueous phase, a procedure we now call FAIRE [
32]. It had been previously hypothesized that this segregation occurred because covalently crosslinked protein–DNA complexes were retained at the interphase of the organic and aqueous solvents, whereas DNA that was not crosslinked (or trapped by crosslinks) escaped into the aqueous phase [
32,
33]. The enrichment of regulatory regions in the aqueous phase was therefore interpreted to indicate relatively inefficient crosslinking between proteins and DNA at these regions. It had been further hypothesized that FAIRE reflected heterogeneity in the distribution of nucleosomes, or differential crosslinking of modified histone tail lysines to the histone octamer, neighboring octamers, or DNA [
32].
At the time FAIRE was first observed, these hypotheses were supported primarily by three observations. First, histones are by far the most abundant and readily crosslinkable protein component of chromatin and thus were likely to dominate the crosslinking profile [
33–
35]. Second, it was known that nucleosomes are disrupted, and in some cases lost all together, at promoters upon gene activation [
10,
11]. Third, in concordance with nucleosome loss at regulatory regions, promoters upstream of highly transcribed genes were more efficiently segregated to the aqueous phase in FAIRE than promoters upstream of poorly transcribed genes [
32].
Before using FAIRE to investigate chromatin throughout the cell cycle, we sought to establish a firm relationship among FAIRE, nucleosome occupancy, and histone modification status. We performed five independent FAIRE experiments on the wild-type yeast strain BY4741. Each sample was comparatively hybridized to a DNA microarray against reference genomic DNA from BY4741. Data from these biological replicates were then used in a comparison of FAIRE to published histone H3 and H4-Myc chromatin immunoprecipitation (ChIP)-chip data [
20], which were derived from samples amplified and labeled in the same manner as the FAIRE samples (
Material and Methods). The correlation between FAIRE and histone ChIP-chip data is strongly negative (A–D). Mitochondrial DNA, which is known to be nucleosome free, was the most highly enriched class of DNA in FAIRE and the least enriched in the histone ChIP-chips. Nearly all elements depleted in the H3 and H4-Myc chips were enriched in FAIRE, and nearly all elements enriched in the histone ChIPs were depleted in FAIRE (E). Therefore, FAIRE is a highly reproducible procedure that enriches for relatively nucleosome-free regions of the genome.
FAIRE Resolves Nucleosome-Free Regions but Not Single Nucleosomes Using Higher-Resolution Microarrays
We tested FAIRE's ability to detect individual features related to chromatin structure, namely deoxyribonuclease I (DNase I) hypersensitive sites [
25], nucleosome-free regions (NFRs) [
23], and the positions of individual nucleosomes [
23]. We hybridized FAIRE samples to microarrays covering 482 kilobases (kb) at 20-bp resolution [
23]. First, we compared the FAIRE enrichment to the distribution of DNAse I hypersensitive sites across 45 kb of Chromosome III. Of the 34 known DNase I hypersensitive sites specific to this region [
25], 28 coincided with strong peaks of FAIRE enrichment (A and B). To test the relationship between NFRs and FAIRE enrichment, probes were mapped relative to the start codon of every gene represented, and FAIRE data were averaged as a function of distance from translation start sites. We found that FAIRE specifically enriches the NFRs immediately upstream of start codons (C). However, in contrast to the micrococcal nuclease-based method employed to characterize the NFRs [
23], FAIRE appears unable to resolve the individual, regularly spaced nucleosomes surrounding the NFR (C). This resolution difference is likely due to the mechanical shearing of chromatin by sonication in FAIRE (~300 to 1,000 bp on average) versus digestion to mononucleosomes in the MNase assay.
FAIRE Is Unaffected by Mutations in Genes Required for Specific Histone Modifications
The hypothesis that FAIRE may reflect differences in histone modification status rather than nucleosome occupancy was based on the observation that formaldehyde forms covalent crosslinks between DNA and chromatin proteins primarily through lysine residues [
36]. Since many lysines in each nucleosome are capable of being modified, and modifications alter the reactivity of the lysine, it is plausible that differential histone modification could create a diversity of formaldehyde reactivity across the chromatin landscape. Each region's unique formaldehyde reactivity, dependent on the combination of histone modifications, would then be captured by FAIRE.
To test this hypothesis, ten strains harboring deletions for histone modification enzymes and four histone tail deletion strains were subjected to FAIRE: H2AΔ1–20, H2BΔ1–32, H3Δ1–30, H4Δ1–27, Δgcn5, Δhat1, Δhat2, Δset1, Δset2, Δset3, Δset4, Δset5, Δset6, and a Δset2Δrad6 double knockout strain (). FAIRE was performed on each of the 14 mutant strains in duplicate, whereas wild-type strain BY4741 is represented by five biological replicates and wild-type strain W303 by a single experiment (). Knockouts of the principal histone methyl- and acetyltransferases and histone deacetylases were of interest because acetylation, dimethylation, and trimethylation are predicted to abolish formaldehyde's reactivity with lysine. The data reveal that the histone modifications mediated by each of these enzymes are not required for fractionation by FAIRE, since each of the 14 mutant strains has a FAIRE profile very similar to the parental wild-type strain BY4741 (). Furthermore, chromatin from a Δrpd3Δhda1Δhos1Δhos2Δhos3 quintuple knockout strain also showed a normal FAIRE fractionation pattern (unpublished data).
| Table 1Saccharomyces cerevisiae Strains Used in This Study |
The amino-terminal tails of histones H2A, H2B, H3, and H4 are the substrate for most known post-translational modifications. If histone modification status were the primary biological phenomena measured in FAIRE, a dramatic change in the fractionation would be expected in strains harboring histone tail deletions. However, H4Δ1–27, which is least correlated with rest of the yeast strains tested, still has a very high correlation coefficient of 0.79 to all other samples.
We conclude that the fractionation between coding and intergenic regions was essentially unaltered in all strains tested (). The fractionation pattern of chromatin mediated by FAIRE persists in every histone modification mutant we have tested, lending support to the hypothesis that nucleosome occupancy, and not histone modification status per se, is the primary determinant of FAIRE.
The Global Pattern of Nucleosome Occupancy Is Maintained throughout the Cell Cycle
Because previous genome-scale experiments measuring nucleosome occupancy were performed on asynchronous cell cultures, it remained formally possible that variation in nucleosome occupancy was restricted to a single cell-cycle phase or a subset of the phases. We arrested cells in late G
1 using the mating pheromone α-factor, released the cells into fresh YPD, and collected synchronized cells at seven time points spanning a single cell cycle. To verify synchronized growth and determine cell cycle phases for each time point, we monitored bud formation by light microscopy and DNA content with DAPI staining (A and B). The collected time-point samples were then subjected to FAIRE, followed by microarray detection (
Material and Methods). Five independent time-course experiments were performed, and thereby each of the seven time points is represented by at least three (and up to five) high-quality data points.
The distribution of FAIRE enrichment values for intergenic regions was compared to that of ORFs throughout the cell cycle. FAIRE was able to detect the previously observed heterogeneous nucleosome occupancy genome-wide at every time point assayed throughout the cell cycle (C). Because all time points showed a similar global pattern of FAIRE enrichment, we conclude that the overall state of the nucleosome occupancy landscape is maintained throughout the cell cycle.
Release from Mating Pheromone Induces Increased Nucleosome Occupancy at FIG1, FIG2, and FIG3 Promoters
To test the ability of our assay to detect nucleosome occupancy changes at promoters, we analyzed nucleosome occupancy at the promoters of three genes known to respond to mating pheromone.
FIG1, FIG2, and
KAR5/FIG3 are important for the yeast mating response and are induced and highly expressed in the presence of mating pheromone [
37]. Activation of
FIG1 by mating pheromone results in nucleosome occupancy loss at its promoter [
38]. The FAIRE data provide evidence for the converse effect by showing that removal of the mating pheromone results in less-efficient FAIRE enrichment of the
FIG1 promoter, indicating a nucleosome occupancy gain that is sustained well after pheromone release (A). This gain of nucleosome occupancy, which correlates with independently obtained expression data, is also characteristic of the promoters of
FIG2 and
KAR5/FIG3 (A and B).
To test whether FAIRE was able to identify other genes responsive to mating pheromone, we sorted the unidirectional promoters by the difference in FAIRE enrichment before and after mating pheromone release. The top promoters (those with a
z-score difference greater than 1) demonstrate a significant increase in nucleosome occupancy (decreased FAIRE signal) upon release from mating pheromone that correlates to lower gene expression levels [
30] (C). Additionally, the identified promoters were more likely to be bound by the transcription factors Ste12, Dig1, and Mcm1, all of which are involved in mating-pheromone response (C) [
39]. Finally, the list of putative pheromone-responsive promoters is also enriched for independently defined “mating-specific genes” [
39]. Of the 45 top promoters, 13 (28.9%) are bound by transcription factors in a mating-specific fashion, versus 47/2,442 (1.9%), of other unidirectional promoters (chi-square
p-value = 1.6 × 10
−31) [
39,
40] (C). Therefore, FAIRE-based measurements of nucleosome occupancy at gene promoters during response to a stimulus or environmental change can be used as a predictor of gene function.
Nucleosome Occupancy at Cell Cycle–Regulated Promoters Varies with the Cell Cycle
Based on the general observation that nucleosome occupancy at a promoter is inversely proportional to the transcription rate of its downstream gene [
20], we hypothesized that nucleosome occupancy at the promoters of cell cycle–regulated genes would be reduced specifically at the cell cycle phase in which that gene exhibited peak expression. To test our hypothesis, we used data from a previous study that found approximately 800 genes to be cell cycle regulated [
30]. We focused primarily on genes with unidirectional (single) promoters, which constitute close to half of the 800. The cell cycle–regulated genes were previously grouped into five categories with somewhat arbitrary dividing lines: G
1, S, S/G
2, G
2/M, and M/G
1 (these categories are used in the online supplement of [
30]). We used these classifications to assign each promoter to the same cell cycle group as its downstream gene. For our initial analysis, we focused on changes in nucleosome occupancy within each promoter as the cells progressed through the cell cycle. To do this, we centered the data for each promoter by the median of the
z-scores recorded at all seven time points, and used these centered
z-score values for downstream analyses (
Material and Methods).
We found a positive correlation between nucleosome depletion at cell cycle–regulated gene promoters and the periodic expression of the cell cycle–regulated genes. In particular, the G
1, G
2/M, and M/G
1 promoters demonstrated periodicity according to the cell cycle (A–C). The bidirectional promoters of cell cycle–regulated genes, which represent a completely independent sample, showed similar results despite the possible competing influence from a second gene (
Figure S1).
Nucleosome Occupancy at S and S/G2 Promoters Does Not Correlate with the Cell Cycle
The nucleosome occupancies of S and S/G
2 promoters did not correlate with the timing of the cell cycle (D and E). The distinct behavior of the S and S/G
2 promoters may reflect properties of the deposition of new nucleosomes during DNA replication (Discussion). It is noteworthy that S/G
2 genes were originally distinguished from “S” genes only by their slower decline in relative mRNA levels after this common peak [
30].
The Nucleosome Occupancy of Most, but Not All, G1, G2/M, and M/G1 Promoters Fluctuate Strongly with Cell Cycle Phase
We have presented evidence that as a group, promoters of cell cycle–regulated genes expressed in G
1, G
2/M, and M/G
1 exhibit changes in nucleosome occupancy that correspond to cell cycle regulation (F). Inspection of the individual promoters of the cell cycle–regulated genes within each of these groups revealed that most, but not all, promoters demonstrated such fluctuations (G
1, 63%; G
2/M, 61%; and M/G
1, 79%) (A–D,
Table S1,
Figure S2). Promoters that experienced cell cycle–related fluctuations in nucleosome occupancy (hereafter referred to as “cycling”) were more likely to be bound by known cell cycle transcription factors than those that did not experience nucleosome fluctuations (A and B). SBF (comprised of Swi6 and Swi4) and MBF (comprised of Swi6 and Mbp1) are heterodimeric transcription factor complexes important for the expression of G
1-specific genes [
41]. Of the 135 G
1 cycling promoters, 54 are bound by SBF or MBF, whereas only 13 of the 78 that did not exhibit fluctuations were bound by either (chi-square
p-value = 4.1 × 10
−4) [
40,
42]. In addition, cycling G
2/M gene promoters were more likely to be bound by Mcm1, Fkh2, and Ndd1, which are known to be important for the expression of G
2/M genes [
41]. Of 85 G
2/M cycling promoters with ChIP data, 16 bind all three transcription factors (at
p < 0.01), whereas none of the 55 non-cycling promoters with ChIP data bind all three (chi-square
p-value = 6.3 × 10
−4) (A and B) [
40]. Furthermore, the previously defined “cycling score” [
30] is higher for genes at which the nucleosome occupancy of the promoter also cycles (unpublished data). Therefore, genes likely to be regulated at the level of transcription can be identified by the characteristic nucleosome occupancy profiles of their promoters. We also note that the promoters of major cell cycle regulators themselves (
CLB1, CLB2, CLB5, CLB6, CLN1, CLN2, CLN3, CDC20, PCL2, PCL9, SIC1, and
SWE1) exhibited cell cycle–dependent changes in nucleosome occupancy (
Figure S3).
G2/M Promoters Are Relatively Depleted of Nucleosomes throughout the Cell Cycle
By examining general nucleosome occupancy within individual cell cycle groups, we found that the unidirectional promoters of G
2/M genes tend to be more depleted of nucleosomes at all time points throughout the cell cycle relative to other unidirectional promoters of cell cycle–regulated genes and all other unidirectional promoters, irrespective of periodic transcription or nucleosome occupancy changes (). This difference in the baseline nucleosome occupancy may indicate a specialized promoter organization required for the expression of genes during mitotic chromosome condensation or during mitosis [
43] (see
Discussion).
Among the Promoters of Cell Cycle–Regulated Genes, Those That Respond to CLN3 and CLB2 Induction Display Particularly Large Changes in Nucleosome Occupancy
Cell cycle progression is tightly regulated in yeast, and the core elements of this regulation are highly conserved throughout eukaryotic lineages. The cell cycle is regulated in part by various cyclins that interact with a cyclin-dependent kinase (CDK) to regulate its activity. In
S. cerevisiae, CLN3 and
CLB2 encode cyclins that activate Cdc28, the main CDK, to promote cell cycle transitions. Cln3 is involved in the G
1 to S transition, and Clb2 is involved in the G2 to mitosis transition. More than half of the cell cycle–regulated genes respond to artificial induction of either
GAL-CLN3 or
GAL-CLB2, indicating that
CLN3 and
CLB2 are major controllers of cell cycle–regulated gene expression [
30]. As expected, cell cycle–regulated genes induced by
GAL-CLN3 expression were generally G
1 genes, whereas those responding to
GAL-CLB2 induction were generally G
2/M genes.
We examined the nucleosome occupancy of promoters of cell cycle–regulated genes induced by
GAL-CLN3 [
30]. These promoters were nucleosome depleted during G
1 time points (18 min and 90 min) and most nucleosomal at G
2/M (63 min), whereas the promoters of genes repressed by
GAL-CLN3 expression [
30] were most nucleosomal at G
1 and depleted of nucleosomes at G
2/M (A). The inverse relationships were observed for the promoters of genes affected by
GAL-CLB2 expression (B). Induced promoters were nucleosome depleted during G
2/M (63 min) and most nucleosomal at G
1 time points (18 min and 90 min), whereas the promoters of genes repressed by
GAL-CLN3 expression [
30] were most nucleosomal at G
2/M and depleted of nucleosomes at G
1.
We next compared the nucleosome occupancy changes at the promoters of cell cycle–regulated genes that were positively induced by GAL-CLN3 or GAL-CLB2 to cell cycle–regulated genes that were not strongly induced by GAL-CLN3 or GAL-CLB2. Comparisons were made within the respective cell cycle phase groups (G1 for GAL-CLN3 and G2/M for GAL-CLB2). We defined “induced” genes as those increasing more than 2-fold and genes “not induced” as those changing less than 1.5-fold in either direction. The promoters of G1 genes induced by GAL-CLN3 were more nucleosome depleted during G1 and also showed greater amplitude in periodicity through the cell cycle compared to the promoters of G1 genes not induced by GAL-CLN3 (C). We used a two-sample Student's t-test to find that the means of the groups were significantly different during G1 time points (p = 2.9 × 10−4 for time 18, p = 0.045 for time 90). Examination of the G2/M gene promoters at the 63-min time point yielded analogous results (p = 0.020 for time 63) (D).
A plausible explanation for the greater nucleosome fluctuations observed for
CLN3- or
CLB2-responsive genes was illuminated by re-examination of the cell cycle expression data. The RNA levels of genes that respond to
CLN3 or
CLB2 induction experience greater changes in relative abundance throughout the natural cell cycle than those that do not respond to
CLN3 or
CLB2 induction (
Figure S4). The greater nucleosome fluctuations observed for promoters of
CLN3- or
CLB2-responsive genes suggest more transcription initiation events at the time of peak RNA levels, rather than regulation by a post-transcriptional mechanism. Further support for this hypothesis was obtained by analysis of the data for all unidirectional promoters of cell cycle–regulated genes, which revealed a positive correlation between the amplitude of nucleosome occupancy and the amplitude of RNA level throughout the time course (
Figure S4). The greater nucleosome depletion at these promoters would then be explained by the known positive relationship between transcriptional initiation and nucleosome loss [
20,
32].
Cell Cycle–Correlated Nucleosome Depletion at Promoters of Genes Not Annotated as Cell Cycle Regulated
We probed the dataset for promoters of genes that were not annotated as being cell cycle regulated but exhibited cell cycle–related fluctuations in nucleosome occupancy (E). We limited our focus to unidirectional promoters that exhibited G1-related fluctuations. Limiting the search to G1-related fluctuations ensured greater statistical significance because the time course captured two separate G1 phases. Data from individual unidirectional promoters were tested for correlation to the average G1 FAIRE profile (r > 0.6) and for a minimum change in FAIRE enrichment (z-score change > 1.0) over the entire time course.
Of the 42 promoters that met these criteria, several are upstream of genes for which, upon closer inspection, there exists independent evidence of cell cycle regulation. For example,
SEC53 is highly induced by
CLN3 [
30] whereas the promoters of YKR011C, YPL056C, YOR343C, and YDR222W are bound by G
1 transcription factors [
40,
42] (E). Overall, five of the 42 (11.9%) are bound by G
1 transcription factors, demonstrating an enrichment over the rest of the unidirectional promoters tested (48/2,486, or 1.9%, chi-square
p-value = 7.7 × 10
−6) [
42].
Most of the identified promoters, however, do not yet have reported connections to the cell cycle. For example, the promoters of YPR022C, MSI1, and YPR003C exhibit strong cell cycle–related changes in FAIRE enrichment but are not annotated as being cell cycle regulated, based on fluctuations in RNA levels (E). RNA transcripts from these genes may be subject to post-transcriptional regulation that would prevent their detection, such as rapid RNA degradation. Cell cycle–regulated transcriptional initiation, in combination with a post-transcriptional regulatory mechanism, could be a means for allowing cell cycle expression of genes under specific environmental or developmental conditions.