Gene expression in eukaryotes is extensively controlled at the post-transcriptional level by RNA-binding proteins (RBPs) and ribonucleoprotein complexes (RNPs) modulating the maturation, stability, transport, editing and translation of RNA transcripts (
Martin and Ephrussi, 2009;
Moore and Proudfoot, 2009;
Sonenberg and Hinnebusch, 2009). Vertebrate genomes encode several hundred RBPs (
McKee et al., 2005), each containing one or more domains able to specifically recognize target transcripts. Furthermore, hundreds of microRNAs (miRNAs) bound by Argonaute (AGO/EIF2C) proteins mediate destabilization and/or inhibition of translation of partially complementary target mRNAs (
Bartel, 2009). To understand how the interplay of these RNA-binding factors affects the regulation of individual transcripts, high resolution maps of
in vivo protein-RNA interactions are necessary (
Keene, 2007).
A combination of genetic, biochemical and computational approaches are typically applied to identify RNA-RBP or RNA-RNP interactions. Microarray profiling of RNAs associated with immunopurified RBPs (RIP-Chip) (
Tenenbaum et al., 2000) defines targets at a transcriptome level, but its application is limited to the characterization of kinetically stable interactions and does not directly identify the RBP recognition element (RRE) within the long target RNA. Nevertheless, RREs with higher information content can be derived computationally from RIP-Chip data, e.g. for HuR (
Lopez de Silanes et al., 2004) or for Pumilio (
Gerber et al., 2006).
More direct RBP target site information is obtained by combining
in vivo UV crosslinking (
Greenberg, 1979;
Wagenmakers et al., 1980) with immunoprecipitation (
Dreyfuss et al., 1984;
Mayrand et al., 1981) followed by the isolation of crosslinked RNA segments and cDNA sequencing (CLIP) (
Ule et al., 2003). CLIP was used to identify targets of the splicing regulators NOVA1 (
Licatalosi et al., 2008), FOX2 (
Yeo et al., 2009) and SFRS1 (
Sanford et al., 2009) as well as U3 snoRNA and pre-rRNA (
Granneman et al., 2009), pri-miRNA targets for HNRNPA1 (
Guil and Caceres, 2007), EIF2C2/AGO2 protein binding sites (
Chi et al., 2009) and ALG-1 target sites in C. elegans (
Zisoulis et al., 2010). CLIP is limited by the low efficiency of UV 254 nm RNA-protein crosslinking, and the location of the crosslink is not readily identifiable within the sequenced crosslinked fragments, raising the question of how to separate UV-crosslinked target RNA segments from background non-crosslinked RNA fragments also present in the sample.
Here we describe an improved method for isolation of segments of RNA bound by RBPs or RNPs, referred to as PAR-CLIP (Photoactivatable-Ribonucleoside-Enhanced Crosslinking and Immunoprecipitation). To facilitate crosslinking, we incorporated 4-thiouridine (4SU) into transcripts of cultured cells and identified precisely the RBP binding sites by scoring for thymidine (T) to cytidine (C) transitions in the sequenced cDNA. We uncovered tens of thousands of binding sites for several important RBPs and RNPs and assessed the regulatory impact of binding on their targets. These findings underscore the complexity of post-transcriptional regulation of cellular systems.