Regulation of mRNA levels is a key mechanism that defines cell identity. Cellular homeostasis requires stable gene expression patterns, while differentiation events in metazoan development or responses to external stimuli involve resetting of the transcriptional program. During the lifespan of an mRNA from transcription over maturation, export, translation, and decay, its activity and abundance is controlled by various mechanisms: histone modifications and DNA methylation determine the epigenetic state of the chromatin environment of a gene depending on the DNA accessibility the transcription machinery can bind and initiate transcription and thereby produce primary transcript at different rates (
Segal and Widom, 2009;
Bell et al, 2010). This is modulated co-transcriptionally by splicing and poly-adenylation (
Millevoi and Vagner, 2010;
Nilsen and Graveley, 2010;
Di Giammartino et al, 2011) and further regulated at the level of nuclear export. Once the mRNA is in the cytoplasm it is subject to further post-transcriptional processing, which can reduce the transcript level in a targeted manner. Two major post-transcriptional regulatory processes influencing the amount of mRNA molecules available for translation are general RNA decay and microRNA-mediated RNA interference.
Single-gene experiments have provided examples of the involved regulatory mechanisms that include transcription factor binding but also what is currently referred to as epigenetic regulation. These summarize chromatin regulation of DNA accessibility through active or repressive histone modifications (
Kouzarides, 2007) or nucleosomal positioning (
Kornberg and Lorch, 1999;
Wyrick et al, 1999), transcriptional repression by DNA methylation of gene promoters (
Bird, 2002;
Eckhardt et al, 2006;
Weber et al, 2007) and post-transcriptional regulation of RNA decay rates by non-coding small RNAs (
Ambros, 2004). Additionally, genome-wide studies successfully approximated mRNA levels with information of transcription factor binding and histone modification patterns at promoter proximal sequences (
Ouyang et al, 2009;
Karlic et al, 2010;
Cheng and Gerstein, 2011).
mRNA abundance, however, may be determined to different degrees by transcriptional and post-transcriptional events and the contribution of these layers may vary depending on how stable or how fast the expression change needs to be. At a quantitative level, there is only a limited understanding of the individual contributions of these regulatory layers. To understand these relationships, we abstract the many layers into two processes: primary regulation of synthesis or transcription on the level of chromatin and secondary, post-transcriptional degradation of mRNA. We assume that the change of mRNA level (dR/dt) depends linearly on mRNA synthesis and degradation,
where [RNA
j] is the RNA concentration for gene
j, [DNA] is constant ([DNA]=1),
txj is the transcription rate, and
dj is the degradation rate of gene
j. For simplification, we initially assume the degradation rate to be constant, meaning independent of gene
j. Therefore in steady state where d
R/d
t=0, the RNA concentration of gene
j is proportional to transcription and degradation rates of gene
j. Subsequently when we investigate the contribution of post-transcriptional regulation, we allow
dj to depend on gene
j (see
Supplementary information section 1 for details). Consequently, we can estimate the individual contribution of transcription and mRNA degradation, or mRNA decay, by correlating them with mRNA levels, respectively.
Here, we explore quantitatively how a prediction of transcription based on chromatin characteristics relates to mRNA levels and how such an approach can quantify changes in mRNA abundance that occur during the course of cellular differentiation. We ask if pluripotent and differentiated cells differ in their regulatory behaviors, potentially relating to differences in cell cycle and the ability to set and propagate epigenetic marks or a different usage of post-transcriptional processes. As a biological model, we use mouse stem cells that we differentiate into a highly pure neuronal population through a defined progenitor state (
Bibel et al, 2007). We focus our analysis on pluripotent embryonic stem (ES) cells and postmitotic glutamatergic neurons (TN). To quantitate the contribution of different regulatory processes to observed mRNA levels, we created a linear model for each cell type based on various measures from transcriptional and post-transcriptional layers. In these models, a measure that is a strong correlate of transcription is expected to be highly predictive of mRNA levels. We found that genome-wide measures of histone modifications and polymerase occupancy alone—measures which stand for the transcriptional layer of regulation—allowed accurate prediction of mRNA levels and explained most of the observed experimental variation in steady-state mRNA levels. In addition, we measured transcript half-life and microRNA abundance in these cells, representing the post-transcriptional layer of regulation, and identified only a minor contribution to the determination of mRNA levels.