Recent advances in RNA-sequencing (RNA-Seq) have provided a means for rapid characterization and quantification of transcriptomes. RNA-Seq involves direct sequencing of complementary DNAs (cDNAs) using high-throughput next generation sequencing (NGS) technologies, followed by mapping of the sequencing reads to the reference genome or gene sets for gene expression analysis and polymorphism detection. Compared to other technologies such as hybridization-based microarrays and Sanger sequencing-based methods, RNA-Seq provides a more comprehensive understanding of transcriptome complexity and the ability to detect a dynamic range of expression levels (Marioni et al.,
2008; Wang et al.,
2009; Mader et al.,
2011), allowing for the identification of novel transcripts, small RNAs, SNPs, alternate splicing products, sense and antisense transcripts, fusion transcripts, and can identify transcription initiation sites (Ozsolak and Milos,
2011).
Next generation sequencing platforms used for RNA-Seq are commercially available from Illumina, Roche, ABI, Helicos BioSciences, and more, and companies are continuously improving their platforms to increase sequencing speeds, accuracy, and depth at a lower cost. Cost reduction and high sequencing performance allow for projects such as the 10 million dollar 100 human genomes
1 and the
Arabidopsis 1001 genomes project (Weigel and Mott,
2009). Even though sequencing capacity continues to increase, protocols for sample library preparation, being laborious, time consuming, and expensive, remain a limiting step. Sequencing library preparation involves the production of a random collection of sequence-ready adapter-modified DNA fragments, with a specific range of fragment sizes. Although several procedures to improve on the Illumina RNA-Seq library preparation have been published (Quail et al.,
2008; Nagalakshmi et al.,
2010; Wilhelm et al.,
2010), these protocols still have several laborious steps including ethanol precipitation, column purifications, and gel extraction for size fractionation. In addition to being time consuming, these steps carry a high risk of cross-contamination and sample mix-up inherent in protocols involving extensive individual sample handling. Recently, Illumina introduced a high-throughput method (TruSeq RNA sample preparation kit) replacing these purification steps with solid-phase reversible immobilization (SPRI) magnetic bead reaction cleanup methodology (Hawkins et al.,
1994; Lennon et al.,
2010). Using this method, a single technician can make 96 libraries from total RNA in 3

days. However, the amount of multiplexing is limited to 24 by the number of available barcodes. Similar improvements can also be seen in the protocols by Zhong et al. (
2011) and Wang et al. (
2011a).
Here we present several improvements to the Illumina sample preparation for RNA-Seq protocol (Illumina Inc., San Diego, USA, Cat. # RS-100-0801) that we have made to generate high-throughput and cost-effective RNA-Seq libraries in a more robust and reproducible way, compared with other current protocols. We integrated a direct mRNA extraction method using Dynabeads oligo dT beads (Invitrogen, Carlsbad, CA, USA) or Sera-Mag oligo dT beads (Thermo Scientific, Indianapolis, IN, USA), which are suitable for RNA extraction from various plant and animal tissues. One challenge for scaling-up protocols to 96-well format is the RNA fragmentation step. Specifically, it is difficult to control the degree of chemical fragmentation in RNA owing to the short incubation time, leading to decreased reproducibility, especially in 96-well formats. To overcome this problem we used enzymatic fragmentation of cDNA. We also used the SPRI magnetic bead reaction cleanup methodology to enable handling of samples in a 96-well format, similar to the TruSeq protocol and that of Zhong et al. (
2011). Further, to reduce protocol time and the number of handling steps, we applied an “on beads” protocol (Fisher et al.,
2011) for several enzymatic reactions including end repair, A-tailing, and adaptor ligation. These changes reduce the potential for human error introduced during the sample preparation process. Finally, we developed 96 unique barcoded adapters to provide more flexibility in multiplexing. With these modifications and a few other small adjustments, we have greatly increased the efficiency and reproducibility, and lowered the cost of library preparation (by ~3–11×) in comparison to other currently available methods. Our high-throughput RNA-seq (HTR) library preparation method enables a single researcher to reproducibly make 96 RNA-Seq libraries, starting from tissue, in less than 3

days. Analysis of the sequencing output from our libraries demonstrated that our protocol yields data whose quality matches or exceeds that of the standard Illumina method (IL) by sequence composition, ribosomal RNA contamination, and detection of gene expression.