Novel high-throughput DNA sequencing technologies have allowed the generation of millions of short reads and have empowered a wide variety of studies such as genome-wide analysis of transcriptomes (RNA-Seq) [
1-
4], transcription factor binding sites (ChIP-Seq) [
5,
6] and whole-genome sequencing and analysis [
7,
8]. However, these studies have often been limited by a high cost per sample and low throughput. A typical sequencing run on an Illumina Genome Analyzer II currently costs about $500 in reagents per flowcell lane and requires ~4 days to complete both the sequencing and Illumina analysis pipeline phases. Moreover, the number of mapped reads (up to 10 M per lane) is often significantly higher than required for the experiment, especially for organisms of small genome sizes such as yeasts, worms and flies.
Multiplex DNA sequencing has been pursued since the beginning of Sanger sequencing [
9] and has been applied to Roche's 454 platform [
10]. Here we describe a multiplexing strategy for Illumina sequencing to process multiple DNA samples simultaneously. The strategy was applied to analyze the targets of three yeast DNA binding proteins (Cse4, Ste12 and RNA polymerase II) using chromatin immunoprecipitation (ChIP) and was shown to yield accurate and high quality results. We also included a reference sample for ChIP-Seq termed input DNA.
ChIP followed by high-throughput sequencing (ChIP-Seq) has been developed to map the protein-DNA interactions at a genome-wide level [
5,
6]. It allows characterization of transcription factor binding and other DNA-binding proteins during development [
5,
11], under different environmental conditions [
6,
12] or in different cell types or tissues. ChIP-Seq has also been used to study the epigenome by mapping the distribution of histone modifications and chromatin-modifying complexes [
12-
14]. Combination of multiple ChIP-Seq experiments can help to determine transcriptional networks [
15].
Cse4 is a centromeric variant of histone H3 [
16] and its human homolog is the centromeric protein A (CENP-A) [
17]. Yeast centromeres span 126 base pairs and are divided in three centromeric DNA elements (CDEI, CDEII and CDEIII); Cse4 binds CDEII [
18]. Cse4 participates in the formation of a specialized hexameric nucleosome with histone H4 and Scm3 that diverges from the standard H2A-H2B-H3-H4 octamer [
19-
22]. The kinetochore assembles at the centromere and Cse4 is required for normal kinetochore assembly and function [
23-
26]. Cse4 mutants display strong chromosome missegregation due to incorrect kinetochore structure [
25,
27,
28].
Ste12 is a transcription factor that regulates two sets of genes: those involved in invasive growth (pseudohyphal growth) and those implicated in the mating response (pheromone stimulation) [
29-
31]. Pseudohyphal growth is a polarized invasion of media by
S. cerevisiae upon nitrogen starvation. It integrates signals from a MAP kinase cascade and the cAMP-dependant pathway [
30,
32-
35]. During pseudohyphal growth, Ste12 associates as a dimer with Tec1 on transcription factor binding sites (TFBS) upstream of invasive genes [
33,
36-
38]. During the mating response, Tec1 is phosphorylated by Fus3 upon pheromone stimulation, leading to its degradation and the binding of Ste12 on the pheromone response elements [
39-
41]. Thus, the dual role of Ste12 depends mainly on different phosphorylation events [
42-
46].
RNA polymerase II (PolII) transcribes most protein-coding genes and is conserved among metazoans. It is recruited to a particular transcription start site depending on the chromatin structure and the presence of preinitiation complexes and transcription factors. In budding yeast, the PolII holoenzyme consists of 12 subunits (Rpb1-12) [
47]. Rpb5, Rpb6, Rpb8, Rpb10 and Rpb12 are shared with the two other major RNA polymerases and Rpb5 contacts DNA to promote transcriptional activation [
48,
49]. PolII directly interacts with the mediator complex, histones, transcription factors, elongation factors and many other proteins [
50]. Although yeast RNA polymerase III genome-wide distribution has been studied extensively using microarrays [
51-
53], only one similar study has been performed to characterize PolII distribution across the yeast genome [
54].
Using multiplex ChIP-Seq, we mapped the targets of Cse4, Ste12 and PolII across the yeast genome. We found a large number of binding sites for each factor. Cse4 localized predominantly to centromeres and we also observed other Cse4 binding sites with lower signal intensity elsewhere in the genome. Ste12 and PolII binding sites were found in close proximity to genes.