|Home | About | Journals | Submit | Contact Us | Français|
Despite their extensive clinical and pathological heterogeneity, all malignant germ cell tumors (GCTs) are thought to originate from primordial germ cells. However, no common biological abnormalities have been identified to date. We profiled 615 microRNAs (miRNAs) in pediatric malignant GCTs, controls and GCT cell lines (48 samples in total) and re-analyzed available miRNA expression data in adult gonadal malignant GCTs. We applied the bioinformatic algorithm Sylamer to identify miRNAs that are of biological importance by inducing global shifts in mRNA levels. The most significant differentially expressed miRNAs in malignant GCTs were all from the miR-371~373 and miR-302 clusters (adjusted p<0.00005), which were over-expressed regardless of histological subtype [yolk sac tumor (YST)/seminoma/embryonal carcinoma (EC)], site (gonadal/extragonadal) or patient age (pediatric/adult). Sylamer revealed that the hexamer GCACTT, complementary to the 2-7 nucleotide miRNA seed AAGUGC shared by six members of the miR-371~373 and miR-302 clusters, was the only sequence significantly enriched in the 3′untranslated region (3′UTR) of mRNAs down-regulated in pediatric malignant GCTs (as a group), YSTs and ECs; and in adult YSTs (all versus non-malignant tissue controls; p<0.05). For the pediatric samples, down-regulated genes containing 3′UTR GCACTT showed significant over-representation of Gene Ontology (GO) terms related to cancer–associated processes, whereas for down-regulated genes lacking GCACTT, GO terms generally represented metabolic processes only, with few genes per term (adjusted p<0.05). We conclude that the miR-371~373 and miR-302 clusters are universally over-expressed in malignant GCTs and coordinately down-regulate mRNAs involved in biologically significant pathways.
Germ cell tumors (GCTs) are clinically and pathologically complex neoplasms that occur from the neonatal period through to late adulthood (1). Benign forms show extensive somatic differentiation and are referred to as mature (MT) and immature (IT) teratoma, while malignant GCTs are classified into seminomas and non-seminomatous tumors [yolk sac tumors (YSTs) and embryonal carcinomas (ECs)] (2).
Despite their heterogeneity, the germ cell theory of tumorigenesis states that all GCTs arise from totipotent primordial germ cells (3). However, biological abnormalities that are conserved across the age and histological range of malignant GCTs have not yet been identified. Here, we investigate the patterns and consequences of microRNA (miRNA) expression across the spectrum of GCTs. miRNAs regulate gene expression via translational repression and mRNA destabilization (4-6), the latter being detectable from mRNA expression changes (7, 8). This regulation is principally determined by the miRNA seed region, which binds to the seed complementary region (SCR) in the 3′ untranslated region (3′UTR) of mRNA targets (9). The seed region comprises nucleotides (nt) 2-8 of the miRNA, with 2-7nt being most critical for binding specificity (10). Importantly, miRNAs play a key role in cancer development, both as oncogenes and as tumor suppressor genes (TSGs) (4, 8, 11-13).
Little information is available concerning miRNA profiles in GCTs. One study used quantitative reverse transcription PCR (qRT-PCR) to determine levels of a restricted range of 156 miRNAs in adult gonadal GCTs, compared to three cases of normal testis (patient age uncertain) (14). This work suggested that the miR-371~373 gene cluster is highly expressed in adult malignant GCTs, a view supported by a genetic screen of primary human cells that implicated miR-372~373 as oncogenes in testicular GCTs (TGCTs), acting through inhibition of large tumor suppressor homolog 2 (LATS2) (15).
To date, there is no published miRNA profiling data for GCTs of pediatric patients, nor of those arising at extragonadal sites in any age-group. It is therefore unknown whether particular changes in miRNA expression represent a fundamental feature of malignant GCTs. The present study had two principal aims. First, we sought to determine global miRNA profiles in pediatric GCTs arising at both gonadal and extragonadal sites, and to compare the changes observed with those reported for adult gonadal malignant GCTs. Second, we applied the bioinformatic algorithm Sylamer (16) to identify miRNA changes that are of biological significance by inducing global shifts of mRNA expression. Our data indicate that the miR-371~373 and miR-302 clusters, of which six members (miR-372~373 and miR-302a~302d) share the identical 2-7nt seed region AAGUGC, have a fundamental role in the pathogenesis of malignant GCTs by down-regulating functionally significant target genes.
The study received Multicenter Research Ethics Committee (ref:02/4/071) and Local Research Ethics Committee (ref:01/128) approval. We performed miRNA expression profiling on 48 samples, representing 32 pediatric GCTs from 22 female and 10 male patients (12 YSTs, 11 seminomas, three ECs, three MTs, three ITs), two testicular seminomas from young adults, eight control samples and six GCT cell lines [CLs; authenticated using short tandem repeat profiling (17); Supplementary Figure S1] (Supplementary Table S1). To avoid confusion with data from our re-analysis of miRNA expression in adult GCTs (14), all of these samples are henceforth referred to as ‘pediatric’. We use ‘seminoma’ to refer to all tumors with seminomatous histology, regardless of site (i.e., testicular seminoma, ovarian dysgerminoma, and extragonadal germinoma) (18, 19). The eight control tissues represented four normal gonadal specimens (one case each of pre- and post-pubertal male and female gonad) and four developmental samples (two fetal yolk sacs and two fetal female gonads).
Total RNA was isolated as described previously (2). Sample and human reference (20) RNA were hybridized to the miRCURY LNA array platform v9.2 (Exiqon, Vedbaek, Denmark). The miRNA GAL file was updated to miRBase v13.01 which annotated 615 probes on the array. The 48 raw miRNA (.txt) data files [Gene Expression Omnibus (GEO) accession number: GSE18155] were processed using the Bioconductor packages limma and ArrayQualityMetrics in R (21, 22). The median expression value of the quadruplicate spots for each miRNA was calculated after subtraction of background intensities. Within-array (global-loess) and between-array (Aquantile method) normalization was performed (23) before a contrast matrix defining all pairwise comparisons was fitted (24). The data were filtered to exclude low variability probes (median expression value inter-quartile-range <0.6) and subsequently used for unsupervised hierarchical clustering, using a distance measure of 1 minus the Pearson correlation coefficient between samples. For heatmaps, values for each probe were centered by subtracting the mean expression value across samples.
Differential expression was assessed using a moderated t-statistic and p-values adjusted for multiple testing using Benjamini and Hochberg’s method (25). miRNAs with adjusted p-values <0.01 were considered statistically significant and differentially expressed. Lists of differentially expressed miRNAs generated for four different comparisons (pediatric malignant GCTs, YSTs, seminomas and EC versus non-malignant control tissue) were subsequently used for Sylamer analysis. Only the most significantly differentially expressed miRNAs (adjusted p<1×10−5) were represented on heatmaps, to enhance visualization of key miRNAs. Taqman qRT-PCR validation of miRNA levels, normalized to RNU24, was performed as previously described (20).
We compared our findings with published miRNA expression data for adult gonadal GCTs, as obtained by qRT-PCR (14). The raw cycle threshold (CT) data file was downloaded from the journal website2. After removal of the four spermatocytic seminoma (SS) samples, which do not occur in the pediatric population, miRNA CT values for the remaining 60 adult tissue samples and five GCT CLs were normalized to let-7a (which displayed the least variable expression across all samples), to obtain Δ CT values. The mean of all ΔCT values for all samples was then subtracted to obtain ΔΔCT values, which were used to perform unsupervised hierarchical clustering analysis and to generate lists of differentially expressed genes, employing the criteria used for the pediatric samples.
Matching global mRNA expression profiles were available for 21 of the 42 pediatric tissue samples examined by miRNA microarray. These represented 17 malignant GCTs (10 YSTs, six seminomas, one EC) and four non-malignant controls, comprising one MT and three normal gonads (one pre- and one post-pubertal testis and one post-pubertal ovary) (Supplementary Table S1). Profiling had previously been performed using the HG-U133A GeneChip (Affymetrix, Santa Clara, CA), comprising 22,283 probe sets corresponding to 13,042 genes. Data for 16 samples had previously been published (2); the EC, MT and three normal controls were previously unreported (GEO accession: GSE18155). In addition, we re-analyzed published data from a study of adult TGCTs that also used the HG-U133A GeneChip [(26); GEO accession: GSE3218], excluding two suboptimal YST samples (K14 and K18) (2). We used data from 25 such specimens, representing eight pure YSTs, 12 pure seminomas and five normal adult testis controls (26).
Raw mRNA (.CEL) files were processed and quantile normalized using Robust Multi-array Average (RMA) in R (6, 21, 27), using the Affymetrix annotation of March 2009. RMA-transformed expression values were analyzed for differential expression (24) with significance studied by t-test and adjusted for multiple testing (25). Pathway enrichment analysis was performed using the Gene Ontology (GO) algorithm3, as it permitted comparison of differentially expressed genes (log2 fold-change <−1.5 and adjusted p<0.01) grouped by the presence or absence of the SCR corresponding to the common 2-7nt seed of the miR-372~373 and miR-302a~d clusters. NCBI Entrez Gene identifiers were evaluated for biological process category over-representation within a total gene universe defined by the HG-U133A annotation library, using the hyperGTest function within the Bioconductor GOstats package (28). GOterms with adjusted p<0.01 (25) were considered statistically significant.
Full details of Sylamer are provided elsewhere4 (16). In brief, the algorithm assesses enrichment and/or depletion of nucleotide words of specific length, complementary to elements of the seed region (nucleotide positions 1-8) of miRNAs (i.e. SCRs), in the 3′UTRs of genes within ranked lists, with significance calculated using hypergeometric statistics. The primary aim of Sylamer is to identify whether changes in miRNA expression are of biological significance, with the secondary aim of producing lists of target genes for further validation.
For each ranked genelist derived from the mRNA expression data, we undertook Sylamer analysis for the six SCR elements of increasing size: three hexamers (corresponding to miRNA seed positions 1-6, 2-7 and 3-8); two heptamers (positions 1-7 and 2-8); and one octamer (position 1-8). Due to over-representation of conserved adenosines flanking SCRs in mRNAs (10), the complementarity criterion was discarded for SCR position 8 (seed position 1), where the nucleotide was always set to be adenosine, irrespective of the actual nucleotide at that position. For each comparison analyzed, the mRNA genelist was first ranked from down-regulated (to the left) to up-regulated (to the right). For each SCR under consideration, an enrichment/depletion p-value was computed at different cut-offs in the ranked gene list. At each cut-off, an SCR was either enriched in the 3′UTRs of the genes to the left and accordingly depleted in the 3′UTRs of the genes to the right, or conversely depleted on the left and enriched on the right. An event of enrichment on one side and corresponding depletion on the other side of the cut-off is associated with a single p-value. Varying the cut-off resulted in a set of p-values for each SCR (y-axis) visualized on a landscape plot (16), in which the log10-transformed p-values were sign-adjusted and plotted against the ranked genelist (x-axis). Sign-adjustment depended on the specific enrichment/depletion status of the pertinent SCR. A point plotted along the positive y-axis signifies that the SCR is enriched in the genes to the left and depleted in the genes to the right, whereas a point plotted along the negative y-axis conversely signifies depletion to the left and enrichment to the right. The displacement along the y-axis identifies the significance of the joint enrichment/depletion p-value for the SCR at that cut-off, according to the sign-adjusted log10-transformation.
For the present study, we combined Sylamer significance scores for different elements of each SCR to obtain a single summed significance score for each group of miRNAs sharing the same seed region. To do this, miRNAs were assigned to groups, defined by a common seed, and a single score was produced for the combined Sylamer results for the set of SCR hexamers, heptamers and octamer particular to each group. This approach integrated signals from different word lengths and increased method sensitivity compared to standard Sylamer analysis. The same analysis was applied to all possible words of 8nt length (all with an adenosine at position 8), including those that do not represent SCRs. The resulting scores followed an extreme value distribution by the nature of the scoring criteria employed. By fitting this distribution, p-values were assigned to the scores, with values <0.01 considered to be significant. As a filtering step, we only considered miRNA groups that contained at least one significantly differentially expressed miRNA.
In initial unsupervised hierarchical clustering analysis, normalized microarray expression data for the 246 miRNAs that showed variable expression in the 48 pediatric samples and CLs were used to generate a heatmap (Figure 1). The dendrogram divided into two main branches, one containing the pediatric malignant GCT tissues and CLs, the other containing the non-malignant tissue, i.e. the teratomas (MT and IT) and the normal and developmental control samples. Only one non-malignant sample clustered with the malignant GCTs - MT tissue (MT-34) from a mixed GCT that also contained a malignant element. The pediatric malignant GCTs subdivided principally by histological subtype, with dendrogram subdivisions comprising seminomas, CLs, YSTs and ECs. The non-malignant samples subdivided into two branches, with developmental controls (fetal yolk sac and gonads) in one branch and normal gonadal tissue (pre- and post-pubertal ovary and testis) with the teratomas in the other.
In further analyses of our dataset we focused on the 42 tissue specimens only, removing the six CLs. No miRNA showed significant differential expression (adjusted p<0.01) between the MT and IT samples. When comparing the six teratoma samples and eight normal control specimens, only two of the 615 miRNAs were differentially expressed, with p-values that only just reached significance (miR-9 and miR-9*; adjusted p=0.0098 and 0.0091 respectively). Consequently, all 14 non-malignant samples were combined for subsequent comparisons with the pediatric malignant GCT tissues. Comparing all malignant GCTs versus the non-malignant samples produced a list of 170 significantly differentially expressed miRNAs, of which 44 (25.9%) were over-expressed in malignant GCTs and 126 (74.1%) under-expressed (Supplementary Table S2A). A heatmap based on the most significantly differentially expressed miRNAs (adjusted p<1×10−5; n=65) showed complete segregation between the pediatric malignant GCT and non-malignant samples (Supplementary Figure S2). The top 10 differentially expressed miRNAs in this comparison are shown in Table 1A. The top nine were from just two miRNA clusters - miR-371~373 and miR-302 (the latter including miR-367) - and all were over-expressed in the malignant GCTs.
Our re-analysis of published qRT-PCR profiling of miRNA expression in adult gonadal GCTs is described in Supplementary Results (Supplementary Figure S3, Supplementary Table S2B). The top 10 differentially expressed miRNAs (Table 1A) were exclusively over-expressed miRNAs from the miR-371~373 and miR-302 clusters. Due to the different platforms used, data for the pediatric and adult samples could not be compared directly. However, parallel plots of expression values for each paediatric and adult specimen of the eight main members of the miR-371~373 and miR-302 clusters [i.e. non-miR* sequences (29)] confirmed differential expression between the malignant GCTs and the non-malignant (teratoma and control) samples (Figure 2A). Expression patterns were similar within histological subtypes, regardless of patient age.
Hierarchical clustering analysis using just the eight main members of the miR-371~373 and miR-302 clusters showed complete segregation of malignant GCTs from non-malignant samples for both the pediatric and adult data [a single outlier (an EC sample) for the latter notwithstanding (Figure 2B)]. Interestingly, six of these eight miRNAs (miR-372~373 and miR-302a~302d) share a common key 2-7nt seed region AAGUGC (Table 1B), which corresponds to the SCR hexamer GCACTT in mRNA targets. There was no evidence that expression of these miRNA clusters was DNA copy number driven in pediatric malignant GCTs, as assessed by 1Mb interval array-based comparative genomic hybridization (30, 31) (data not shown). For details of miRNA expression profiles in malignant GCT subtypes and CLs, and associations between miRNA expression and tumor site, see Supplementary Results, Supplementary Tables S2-S4 and Supplementary Figures S4/S5.
To identify transcription factors (TFs) that may be responsible for miR-302 and miR-371~373 cluster over-expression, we examined gene expression profiles in our pediatric and the published adult (26) GCT datasets. For this screening exercise, we applied less stringent criteria of log2 fold-change >1.0 and adjusted p<0.01. We identified six TFs that were over-expressed in malignant GCTs overall, and in at least one malignant subtype analysis, in both the pediatric and adult datasets (Table 1C). While NANOG, POU5F1, TFAP2C and SOX15 were specifically over-expressed in seminomas, SOX17 and TEAD4 were over-expressed in both seminomas and YSTs. The pediatric EC over-expressed all TFs except SOX17. For the 21 pediatric samples for which matched miRNA and mRNA expression data were available, linear regression analysis showed a positive correlation between the median expression value for the eight main miRNAs from the miR-371~373 and miR-302 clusters and expression levels of SOX17 and TEAD4 (p<0.0005 and p=0.06, respectively) (Figure 3A).
Having validated expression levels of selected miRNAs using qRT-PCR (Supplementary Results, Supplementary Figure S5), we next performed Sylamer analysis on complete mRNA genelists ranked according to differential expression between pediatric malignant GCTs and non-malignant tissues. We focused on SCRs corresponding to seeds in miRNAs that we had identified as being differentially expressed, as the majority of pediatric malignant GCT samples in which miRNA profiling had been performed had matched mRNA profiling data. For further details of the analysis model, and the demonstration of the advantages of miRNA target identification using Sylamer rather than standard prediction methods, see Supplementary Results and Supplementary Figures S6/S7.
We observed that the SCR hexamer GCACTT, complementary to the common 2-7nt seed region AAGUGC of miR-372~373 and miR-302a~302d, was the most enriched in the mRNAs down-regulated in pediatric malignant GCTs (single p-value for all SCR elements of different length=3.81×10−4), YSTs (p=4.96×10−5) and ECs (p=7.18×10−4), compared to non-malignant tissues (Figure 4). No enrichment for this common SCR was seen when pediatric seminomas were compared with controls. Similar analysis of published data for adult TGCTs (26) also revealed that the common SCR hexamer GCACTT was the most enriched in genes under-expressed in adult testicular YSTs versus normal adult testis (single p-value for all SCR elements=1.69×10−3) (Figure 4). This hexamer showed non-significant enrichment in down-regulated genes when considering all adult TGCTs (p=0.09) (Figure 4) and adult testicular seminomas compared to normal adult testis. No other SCRs corresponding to the seeds of significantly over-expressed miRNAs were over-represented (p<0.01) in the 3′UTRs of genes under-expressed in malignant GCTs in either the pediatric or adult datasets. For Sylamer analysis of global shifts in mRNA profiles corresponding to miRNAs under-expressed in malignant GCTs, see Supplementary Results and Supplementary Figure S8.
Of 120 mRNAs that we identified as significantly down-regulated in pediatric malignant GCTs versus non-malignant tissue samples, transcript and 3′UTR information were available for 102. Sylamer identified that while the common 2-7nt SCR GCACTT was present in the 3′UTR of 17.4% (n=30) of the 172 up-regulated mRNAs in this comparison (log2 fold-change >1.5 and adjusted p<0.01) and a similar percentage of all 13,042 genes covered by the Affymetrix U133A GeneChip (16.3%; n=2,125), it was enriched in the 102 down-regulated mRNAs, being present in 41 (40.2%) (Table 2A). The 41 mRNAs showed significant over-representation of the GO terms ‘regulation of cellular and biological processes’, ‘intracellular signaling cascade’ and the three related terms ‘regulation of GTPase activity’, ‘regulation of Ras protein signal transduction’ and ‘regulation of small GTPase signal transduction’ (Table 2B). In contrast, for the remaining 61 down-regulated mRNAs in which the common SCR was absent, the over-represented GO terms were generally related to a small number of metabolic processes only, with small numbers of genes per term (Table 2C).
Of the 41 mRNAs containing the common 2-7nt SCR in the pediatric comparison, we identified 22 that were also present in the corresponding adult genelist (and therefore most likely to be direct targets of the miR-372~373 and miR-302a~302d families), including numerous cancer-associated genes (Table 2D, Supplementary Results and Supplementary Table S5). Linear regression analysis showed significant negative correlations for 21 of the 22 genes between expression levels and the median expression value for the six miRNAs from the miR-371~373 and miR-302 clusters that contain the common 2-7nt seed AAGUGC, using data from the 21 pediatric samples with matched miRNA and mRNA expression data (Figure 3B and Supplementary Figure S9).
Similar observations were made for mRNAs down-regulated in pediatric YSTs versus non-malignant tissue and for mRNAs down-regulated in pediatric seminomas versus non-malignant tissue. For both comparisons, the common 2-7nt SCR was enriched in down-regulated genes, being present in 37.3% and 32.1% respectively (Table 2A). Likewise, GO terms for SCR-containing mRNAs included a range of cancer-associated processes, while GO terms for mRNAs without the SCR generally represented metabolic processes only (Supplementary Tables S6/S7). For details of enrichment for the common 2-7nt SCR GCACTT in down-regulated mRNAs in adult TGCTs and analogous GO analysis, see Supplementary Results and Supplementary Table S8. For genes down-regulated in YSTs versus non-malignant samples and seminomas versus non-malignant samples for the pediatric and adult datasets, see Supplementary Results and Supplementary Tables S9/S10, respectively.
In this study we have demonstrated that the majority of miRNAs differentially expressed in pediatric malignant GCTs are down-regulated, as has been observed for other types of malignancy (32). Nevertheless, the most significant differential expression was up-regulation of the miR-371~373 and miR-302 clusters, regardless of histological subtype, tumor site (ovary, testis or extragonadal) or patient age. Over-expression of these miRNAs appears to be specific to malignant GCTs, with no similar findings for other malignancies or diseases to date, save for miR-372~373 over-expression in an isolated case of an exceptionally rare embryonal brain tumor, at much lower levels than in malignant GCTs (33).
The miR-371~373 cluster was previously reported to be over-expressed in adult gonadal malignant GCTs, based on qRT-PCR (14) and RNase protection assay (15). However, these reports are inconsistent regarding expression of the miR-302 cluster in such tumors. One stated that miR-302a~302d expression was undetectable in many miR-371~373-expressing malignant GCTs, consistent with miR-371~373 over-expression being a selected event in malignant GCT development (15). In contrast, the other appears to illustrate over-expression of miR-302a~302d in all adult gonadal malignant GCTs, although this was not explicitly commented on (14). Our re-analysis of the published qRT-PCR data shows that the miR-302 cluster is as significantly over-expressed as miR-371~373 in adult gonadal malignant GCTs compared to non-malignant tissues (teratomas and controls) (Table 1A), mirroring our observation for pediatric gonadal and extragonadal malignant GCTs. As both miRNA clusters are believed to be ESC-specific pluripotency markers (34-38), our findings suggest that expression of the miR-371~373 and miR-302 clusters in malignant GCTs either represents persistence of an embryonic pattern of miRNA expression that is not present in normal tissues and teratomas (the latter having undergone somatic differentiation), or acquired re-expression, regulated by an as yet undetermined mechanism. We observed associations between miR-371~373/miR-302 levels and TF over-expression, warranting future investigations of their functional relationships, which may be complex. For example, NANOG and POU5F1 have binding sites in the miR-302 (35, 39, 40) and miR-371~373 (40) cluster promoter regions, while POU5F1 is negatively regulated by miR-145 (41) which is significantly down-regulated in both pediatric and adult malignant GCTs (Supplementary Table 2A/B). Our observation of SOX17 over-expression in seminoma and YST, but not in an EC sample, is consistent with previous reports (42-44).
Our Sylamer analysis strongly suggests that over-expression of the miR-371~373 and miR-302 clusters is functionally important in malignant GCTs by globally affecting levels of target mRNAs. We observed significant enrichment of the SCR hexamer GCACTT (complementary to the common miR-372~373 and miR-302a~302d 2-7nt seed AAGUGC) in genes under-expressed in pediatric malignant GCTs, YSTs and EC versus non-malignant controls and in adult YSTs versus testicular controls. Sylamer did not identify over-representation of SCRs corresponding to other over-expressed miRNAs. Nevertheless, such miRNAs may contribute to the clinicopathological heterogeneity of malignant GCTs, by targeting a smaller, more discrete number of mRNAs. For example, each subtype of pediatric malignant GCT showed specific abnormalities of miRNA expression (such as over-expression of miR-182~183 cluster in seminomas, miR-375 in YSTs and miR-515~526 cluster in ECs) and we observed significant differential miRNA expression in intracranial versus extracranial seminomas. Interestingly, however, the striking differences in mRNA expression that we previously observed between pediatric and adult malignant GCTs (2) were not reflected by similar differences in miRNA expression profiles.
Using GO analysis, we demonstrated that for pediatric malignant GCTs, and their main subtypes YST and seminoma, the down-regulated mRNAs containing the SCR corresponding to the common miR-372~373/miR-302a~302d seed mediate cellular processes important in oncogenesis and malignant progression (signal transduction, cell cycle, development and morphogenesis, etc.), in contrast to the small number of metabolic processes identified for down-regulated mRNAs without the common SCR. Together, these findings indicate the generalized functional significance of miR-372~373 and miR-302a~302d in the biology of malignant GCTs. Interestingly, these miRNA clusters, via their common 2-7nt seed AAGUGC, are known to be essential for regulating G1-S transition and promoting rapid proliferation in embryonic stem cells (45, 46). Our data further support the use of GO enrichment analysis to identify groups of genes targeted by the same miRNA seed that share a biological function (47). Of note, considerably weaker signals were obtained from the Sylamer and GO enrichment analysis of the adult mRNA dataset (26), in which controls were normal adult testis samples only, leading to large numbers of differentially expressed genes being related to male reproduction and spermatogenesis. We obtained a more tractable list of differentially expressed genes by selecting a range of control tissues containing normal germ cells at different developmental stages.
In conclusion, our data indicate that the miR-371~373 and miR-302 clusters are universally over-expressed in malignant GCTs and are of functional significance by down-regulating mRNAs involved in biologically significant pathways. It will now be important to translate our findings clinically. The miRNA expression changes we describe may improve tumor diagnosis and post-treatment monitoring, and enable novel therapeutic approaches that target fundamental abnormalities of malignant GCT cells.
Financial support: Medical Research Council Clinical Research Training Fellowships [RDP and MJM], Glaxo-SmithKline Postdoctoral Fellowship [HKS] and Wellcome Trust Sanger Institute Postdoctoral Fellowship [CA-G], with further support from the Medical Research Council, Cancer Research UK, CLIC Sargent, Parthenon Trust, and Addenbrooke’s Charitable Trust.