Here, we developed an efficient combined computational and biological approach to predict and to prioritize cancer microRNAs for biological investigation. We demonstrated this strategy as an effective economical alternative to comprehensive microRNA analysis in cancers such as HNSCC for which prior genomic array datasets (mRNA or microRNA) are less abundant. This approach also allowed the identification of functional gene targets of the deregulated microRNAs that would otherwise require paired profiling of mRNA and microRNA expression for which the feasibility is often limited by the additional costs, or by the lack of access to the tissue. Employing this method that integrates the analysis of microRNA target predictions, differential HNSCC gene expression and the cancer genes in the OMIM genetic dataset, we identified and characterized miR-204, located within its host gene TRPM3 at the 9q21.1–q22.3 region frequently incurring allelic loss
[11]–
[15], as a potential tumor suppressor microRNA of HNSCC and possibly of other epithelial cancers. The high propensity of LOH at 9q21.1–q22.3 that occurs in 37% HNSCC pre-malignant conditions, further increases to 67% in cancer state
[14] suggesting the presence of tumor suppressor gene candidates. While tumor suppressor genes at other frequent allelic loss loci in HNSCC have been identified, gene candidates responsible for the tumor suppressor activity associated with the 9q21 locus remain elusive. Here, we provided a plausible mechanism that loss of tumor suppressor function of miR-204 as a result of allelic imbalance at 9q21.1–q22.3 may significantly increases the genetic susceptibility to HNSCC oncogenesis and progression. LOH at this locus is also seen in the squamous cell carcinoma (SCC) of the esophagus
[42] and SCC of the lung
[43] suggesting a common somatic genetic lesion underlies the development of SCC of diverse tissue origin. The highly coordinated and nearly complete suppression of miR-204 and its host gene TRPM3 () raises the possibility that TRPM3 mRNA expression may serve as a marker to indicate miR-204 expression status in HNSCC or other tumors, and also potentially LOH at 9q21.1–q22.3. Since a small variation in the expression of a specific microRNA is expected to affect the expression of tens or hundreds of target mRNAs, genetic variations in a microRNA expression at the chromosomal break point, as we observed with miR-204 at the 9q21.1–q22.3 locus, could represent an effective mechanism of cancer predisposition, a hypothesis that is supported by emerging experimental evidences
[44],
[45]. A few recent studies have reported genome-wide microRNA expression changes using HNSCC cancer cell lines
[46]–
[49] or tumor tissues
[49]–
[51]. While similar miR-204 downregualtion was reported in head and neck cancer cell lines based on microarray analysis
[46],
[48], its expression status was not further confirmed by PCR or other methods and its biological functions were not explored. Additionally, since its identification
[24] biological characterization of miR-204 functions in normal development remain limited. Thus far, miR-204 was implicated in affecting global mRNA expression levels in the retina
[52]; and was shown to regulate mesenchymal progenitor cell differentiation
[53].
Through enrichment analysis and network modeling using mRNA gene expression profile, we identified a set of functionally related miR-204 targets that showed increased mRNA expression in HNSCC upon miR-204 suppression (). The presence of miR-204 binding sites (Table 8 in
Text S2), the coordinated up-regulation and the ability of increased miR-204 function to specifically inhibit the expression of 18 out of 21 gene targets (86%) (Figure 2D, Supporting Figures 4–5 in
Text S1) suggest that these predicted genes are very likely selective and direct miR-204 targets in HNSCC. This finding is consistent with the genome-wide association between microRNA binding sites and the ability of corresponding targeting microRNAs to alter their gene expression
[54]. This is the first report of a large set of functionally related cancer microRNA targets that was identified via high throughput computational approaches and confirmed biologically. In addition, the joint analyses of sequence-base information and mRNA expression arrays yielded an accuracy rate of 86% of miR-204 target predictions which surpasses the published accuracy (about 40%) of each sequence-based method when used alone
[10],
[55],
[56].
More broadly, we demonstrated a computational framework for predicting altered regulatory networks and biological functions associated with differentially expressed microRNA targets. Indeed, our combined systems biology approach uncovered previously unknown connections between microRNA regulation, network topology, and expression dynamics for which we obtained thorough biological validations. While genome-scale analyses of interactions among microRNA gene targets in the context of a cellular or protein-protein interaction networks have been conducted computationally
[57]–
[59], such methods and observations await biological confirmation. Here we significantly extended the observations of two recent reports on network modeling
[31],
[32] and demonstrated the feasibility and validity of deploying statistical and bioinformatics approaches to derive regulatory networks corresponding to altered expression of proteins targeted by microRNAs (). Further, combining functional enrichment analysis with network modeling leads to the unbiased prioritization of an EGFR-dependent protein regulatory network connected via up-regulated gene targets of microRNAs in human HNSCC (). Topological analyses of hub and bottleneck properties further identified key regulatory proteins within the EGFR network (). miR-204 appeared critical to regulate the function of this “prioritized HNSCC PPIN” as its gene targets exhibited significant enrichment of hub and bottleneck properties (). Since the EGFR network was derived from overexpressed genes in HNSCC, the functional enrichment of its 56 proteins suggests their positive regulation of cell cycle, cell/matrix adhesion and extracellular matrix modeling. Using this approach, the biological effect of altering the function of a specific microRNA, such as miR-204, can be accurately predicted via its gene targets that are key regulators of a protein network. Accordingly, enhancement of miR-204 function inhibited the expression of its functionally related gene targets (Figure 2D, Supporting Figures 4–5 in
Text S1) in the “prioritized HNSCC PPIN” and lead to the reduced adhesion, migration and invasion
in vitro () and experimental lung metastasis
in vivo (). Further, the strong association of overexpression of functional miR-204 gene targets with an earlier relapse in a sub-type of HNSCC tumors expressing an EGFR-pathway signature () suggests that miR-204 expression and its deregulated gene targets could be potentially used for mechanism-based prognostic stratification of HNSCC patients to complement the conventional clinical-pathological tumor diagnosis. In fact, the feasibility of employing microRNA as sensitive and informative biomarkers for molecular diagnosis has recently been demonstrated
[60].
Collectively, these findings show that single protein network modeling and statistical functional enrichment of a PPIN can illuminate altered complex biological processes and regulatory pathways associated with microRNA dysfunction in cancer with high precision. Complementary approaches have been developed to analyze gene expression changes in the molecular and biological context for candidate gene prioritization and for deriving mechanistic understandings that are most relevant to cancer biology
[61]–
[64]. The system's properties and microRNA-regulated molecular networks we discovered could be exploited for the design of “network mechanism”-based therapies to specifically restore tumor suppressor microRNA functions as an alternative to the single-gene target paradigm and merits further investigation.