Dysregulation of physiologic microRNA (miR) activity has been shown to play an important role in tumor initiation and progression, including gliomagenesis (
Gabriely et al., 2011;
Godlewski et al., 2008;
Kim et al., 2010a;
Kim et al., 2011;
Kwak et al., 2011). Therefore, molecular species that can regulate miR activity on their target RNAs, without affecting the expression of relevant mature miRs, may play equally relevant roles in cancer. Yet, few such modulators of miR-activity have been characterized (
Krol et al., 2010;
Poliseno et al., 2010), and both the extent and relevance of their role in controlling normal cell physiology and pathogenesis are poorly understood.
By analyzing a large set of sample-matched gene and miR expression profiles from The Cancer Genome Atlas (TCGA), we show here that the regulation of target genes by modulators of miR activity is surprisingly extensive in human glioma and that it affects genes with an established role in gliomagenesis and tumor subtype implementation. Specifically, we study two types of miR activity modulators with distinct molecular mechanisms ().
Sponge modulators include both messenger RNAs (mRNAs) and noncoding RNAs, which share miR-binding sites with other RNAs targeted by the miR. Thus, these modulators act as miR
sponges or competitive endogenous RNA (ceRNA) via an established titration mechanism (
Arvey et al., 2010;
Ebert et al., 2007;
Poliseno et al., 2010). Depending on their expression levels and on the total number of functional miR binding sites they share with a target, sponge modulators can decrease the number of free miR molecules available to repress other functional targets.
Non-sponge modulators, on the other hand, are implemented by proteins and RNAs acting via a variety of alternative mechanisms, including activation or suppression of miRISC-mediated regulation of target RNAs (
Krol et al., 2010), protection from miR degradation (
Chatterjee et al., 2011), or prevention of miRs from binding their targets (
Eiring et al., 2010). As a result, they do not necessarily share miR-binding sites with their modulated targets. Established sponge-modulators include VCAN (
Lee et al., 2010), PTENP1 (
Poliseno et al., 2010) and CD44 (
Jeyapalan et al., 2011), while non-sponge modulators include miRISC core components, such as the members of the AGO and TNRC6 families (
Krol et al., 2010). Notably, genetic alterations at the PTENP1, AGO2 and TNRC6A loci have all been implicated in tumorigenesis (
Kim et al., 2010b;
Poliseno et al., 2010;
Zhou et al., 2010).
To evaluate both the range and potential tumorigenic role of this class of miR-mediated interactions, we present a new multivariate analysis method, called Hermes. Hermes systematically infers candidate modulators of miR activity from large collections of genome-wide expression profiles of both genes and miRs from the same tumor samples. Hermes extends the functionality of MINDy (modulator inference by network dynamics) algorithm, which uses measurements from information theory to identify genes that modulate transcription factor activity via posttranslational modifications. MINDy has been used to infer post-translational modulators of the MYC transcription factor in human B cells (
Wang et al., 2009b), to infer signaling modulators of all transcription factors in human B cells (
Wang et al., 2009a), and to identify the ubiquitin conjugating ligase HUWE1 as a modulator of N-MYC turnover in neural stem cells (
Zhao et al., 2009).
In essence, MINDy and Hermes make inferences by estimating two quantities from information theory: the Mutual Information (MI) and Conditional Mutual Information (CMI). The MI quantifies how much one variable informs about another variable (i.e., high MI between two variables implies that knowledge about the first variable is predictive of state of the second variable). The CMI calculates the expected value of MI of two variables given the third variable. Specifically, given a modulator (M), a regulator (R), and a regulated target (T), the algorithms dissect the regulatory dependency of these three components by studying the difference between the Conditional Mutual Information (CMI) of the regulator’s expression level and the target’s expression level, (conditional on the expression level of the modulator) and the Mutual Information (MI) of the regulator and target expressions,
ΔI =
I[
R; T|
M] −
I[
R; T] (
Wang et al., 2006). These quantities and their associated statistical significance can be computed from large collections of gene expression profiles (>250 samples), using a variety of estimators for MI and CMI (
Wang et al., 2006), i.e. computational tools that can quantitatively estimate their values.
Hermes expands the MINDy information theoretic framework to identify candidate genes that modulate miR activity (i.e., modulators), whose availability M affects the relationship between the expression of miRs targeting a gene T and its expression profile, T. We use the term miR program to indicate a set of miRs targeting a gene and the term common miR program to indicate the intersection between the miR programs of two distinct genes. Analysis of Hermes-inferred sponge and non-sponge interactions in TCGA glioblastoma data revealed a regulatory network of previously unsuspected size. Experimental validation of 29 such interactions (26 sponge and 3 non-sponge), of which only 3 failed to validate, suggested that Hermes has a low false positive rate and showed that mPR interactions participate collectively in regulation of key drivers of gliomagenesis and tumor subtype, that these interactions mediate cross-talk between independent pathways, and that they affect cell pathophysiology.