A distinguishing feature of metazoan genomes is the abundance of noncoding RNA (ncRNAs), which function by means other than directing the production of proteins. In addition to small regulatory RNAs such as miRNAs, recent studies have predicted the existence of long ncRNAs-- ranging from 300 nucleotides (nt) to over 10 kb-- that are spliced, polyadenylated, and are roughly as diverse in a given cell type as protein-coding mRNAs (
Bertone et al., 2004;
Carninci et al., 2005;
Kapranov et al., 2005;
Rinn et al., 2003). Long ncRNAs may have diverse roles in gene regulation, especially in epigenetic control of chromatin (
Bernstein and Allis, 2005). Perhaps the most prominent example is silencing of the inactive X chromosome by the ncRNA XIST. To normalize the copy number of X chromosomes between male and female cells, transcription of XIST RNA from one of the two female X chromosome is involved in recruiting Polycomb group proteins (PcG) to trimethylate histone H3 on lysine 27 (H3K27me3), rendering the chromosome transcriptionally silent (
Plath et al., 2003). It is believed that Polycomb Repressive Complex 2 (PRC2), comprised of H3K27 histone methyl transferase (HMTase) EZH2 and core components Suz12 and EED, initiates this histone modification and subsequently Polycomb Repressive Complex 1 (PRC1) maintains this modification and promotes chromatin compaction (reviewed by (
Sparmann and van Lohuizen, 2006). Presently, the mechanism by which XIST ncRNA guides Polycomb activity is unclear. Several PcG proteins possess RNA binding activity, and RNA is required for PcG binding to DNA, suggesting that specific ncRNAs may be critical interfaces between chromatin remodeling complexes and the genome (
Bernstein et al., 2006;
Zhang et al., 2004).
In addition to dosage compensation, long ncRNAs may also play critical roles in pattern formation and differentiation. In mammals, thirty-nine HOX transcription factors clustered on four chromosomal loci, termed HOXA through HOXD, are essential for specifying the positional identities of cells. The temporal and spatial pattern of HOX gene expression is often correlated to their genomic location within each loci, a property termed colinearity (
Kmita and Duboule, 2003;
Lemons and McGinnis, 2006). Maintenance of HOX expression patterns is under complex epigenetic regulation. Two opposing groups of histone modifying complexes, the trithorax group (TrxG) of H3K4 HMTase and the PcG H3K27 HMTase, maintain open and closed chromatin domains in the HOX loci, respectively, over successive cell division (
Ringrose and Paro, 2007). Transcription of many ncRNAs has been observed in fly, mouse, and human HOX loci (
Bae et al., 2002;
Bernstein et al., 2005;
Carninci et al., 2005;
Drewell et al., 2002;
Sessa et al., 2006), and three models have been proposed to account for their action based on experiments in
Drosophila. First, elegant genetic studies suggested that transcription of ncRNAs altered the accessibility of DNA sequences important for TrxG and PcG binding; the act of intergenic transcription enabled TrxG activation of downstream HOX genes and prevented PcG-mediated silencing (
Ringrose and Paro, 2007;
Schmitt et al., 2005). Second, the above model has been extended by a recent report that several ncRNAs transcribed 5′ of the Drosophila Hox gene
Ubx bind to and recruit the TrxG protein Ash1 to the
Ubx promoter, thereby inducing
Ubx transcription (
Sanchez-Elsner et al., 2006). However, these results have been challenged by the third model of “transcriptional interference”, where transcription of 5′ ncRNAs into the promoters of downstream Hox genes prevents Hox gene expression, leading to transcriptional silencing
in cis (
Petruk et al., 2006). The extent to which any of these models and alternative mechanisms explain the copious amount of transcription in mammalian HOX loci remain to be discovered. Nonetheless, the large number of HOX ncRNAs, their complex clustering on the chromosomes, and potentially diverse modes of action suggest ncRNAs play a significant role in HOX regulation. By profiling the entire transcriptional and epigenetic landscapes of ~500 kilobase HOX loci, at near nucleotide resolution, we will begin to discern competing models of ncRNA action in humans and reveal potentially new mechanisms of ncRNA function.
Transcriptomic and proteomic analysis of the HOX loci requires pure cell populations with distinct positional identities. Rather than study whole animals where cells of many histologic types and positional identities are intermixed, we and others have observed that primary adult human fibroblasts retain many features of the embryonic pattern of HOX gene expression both
in vitro and
in vivo (
Bernstein et al., 2005;
Chang et al., 2002;
Rinn et al., 2006). Differential and colinear expression of HOX genes in adult fibroblasts faithfully reflects their position along the anterior-posterior and proximal-distal axes of the developing body (
Rinn et al, 2006), and is believed to be important for maintenance of regional identities of skin throughout the lifetime of the animal (
Chuong, 2003). The remarkable persistence --over decades-- of the embryonic patterns of HOX gene expression in these human cells suggest the action of a powerful epigenetic machinery operative over the HOX loci. In this study, we create an ultra-high resolution tiling microarray to interrogate the transcriptional and epigenetic landscape of the HOX loci in a unique collection of primary human fibroblasts with eleven distinct positional identities. Our results identify numerous novel human HOX ncRNAs, clarify potential mechanisms of their regulation, and reveal a novel mechanism of ncRNA-assisted transcriptional silencing via the PcG proteins
in trans.