Sequencing has shown that SNPs in miRNA coding genes, and specifically in miRNA seed regions, are rare
27,30 and would at first glance seem to be of limited functional importance. On close inspection, however, it is apparent that such rare occurrence is the result of well-documented evolutionary selective pressures
27,31 (
BOX 1).
Box 1. Evolutionary selective pressures on mir-SNPsFrom the seminal findings by Ambros
150, Ruvkun
151 and colleagues that
lin-4 (REF.
150) and the 3′ untranslated region of its target, lin-14 (REF
151), had blocks of conservation across
Caenorhabditis elegans species, cross-species sequence conservation has become part of the standard used in prediction of both microRNA (miRNA) genes and their target sites. The widespread understanding that has stemmed from these and subsequent studies is that the functional importance of these sequences leads to selective pressure that limits the frequency of deleterious alleles. This is in fact the case, as single nucleotide polymorphism (SNP) density in miRNA genes and miRNA target sites has been found to be lower in human miRNA loci
27, as well as predicted miRNA binding sites, particularly seed sites, than their flanking regions
30. SNPs in miRNA seed regions were predicted to occur at a frequency of 1%
27. Conversely, given the constraint on sequence conservation as a means of prediction, it is not entirely surprising to find that SNP density is, in fact, lower in these regions. It is satisfying that evidence for negative selection has also been observed in human-specific miRNAs; that is, those that do not show cross-species conservation
13. Although globally rare, mir-SNPs may be positively propagated, and so contribute to gene expression differences that become established in species. Such is the indication from miRNAs that are conserved in primates only
152 or that are human-specific
57, and there is evidence of local positive selection that may have contributed to expression diversity and adaptation, as well as higher-order thinking in humans. mir-SNPs may be another genetic modification that explains phenotypic differences in humans compared with other primates in the absence of high genotypic diversity. From the perspective of the molecular epidemiology of cancer, because mir-SNPs are so rare, their likelihood to be disruptive is higher.
SNPs in miRNA genes are thought to affect function in one of three ways: first, through the transcription of the primary transcript; second, through pri-miRNA and pre-miRNA processing; and third, through effects on miRNA–mRNA interactions (,). See
Supplementary information S1 (table) for a list of SNPs in all currently known pre-miRNAs and mature miRNAs, which has been created using build 130 of
dbSNP (see
Further information). Most studies that have followed a biologically based candidate gene approach to search for SNPs in miRNAs that might confer cancer susceptibility rely on knowledge of a functional link between a particular miRNA and gene target.
The first evidence that point mutations in miRNA genes can have a functional effect and confer cancer susceptibility comes from a seminal study by Carlo Croce’s group
32 in which a germline mutation in
pri-mir-16-1 was found in a kindred with familial chronic lymphocytic leukaemia (CLL) and resulted in low levels of
miR-16-1 expression (). Moreover, the mutation was subsequently discovered in New Zealand black mice that naturally develop CLL-like disease
33. However, this mutation was not detected in a large panel of other tumour types, thereby showing specificity for cancers of a particular origin
34. Notably, the oncogenic
mir-17–92 cluster contains two miRNAs that harbour three SNPs (see
Supplementary information S1 (table)), but the relevance to inherited cancer risk and carcinogenesis remains to be investigated.
To date, the literature suggests that the functional consequences of SNPs in pri-miRNAs relate to processing and levels of the mature miRNA. For example, SNPs in the pri regions of
let-7e and
mir-16 lead to decreased mature miRNA levels
29,32. Indeed, several studies show an association between pri-mir SNPs and cancer risk (). Specifically, rs7372209 in
mir-26a-1 was associated with a 64% decreased risk of bladder cancer in females
35, and a twofold increased risk of premalignant oral lesions
36. The rs531564 SNP in
mir-124-1 is associated with an increased risk of bladder cancer
35 and oesophageal cancer in males
37, and the pri-miR SNP rs213210 in
mir-219 increased the risk of oesophageal cancer
37. Another pri-miR SNP, rs2292932 in
mir-149, has been tested in several cancers but has not been associated with cancer risk
38,39. This suggests that the molecular mechanisms underlying the genetic associations of mir-SNPs with cancer are complex and vary by cancer site.
| Table 1SNPs in miRNAs and cancer risk |
Case–control studies have also provided evidence for an association of pre-mir SNPs and cancer risk. Rs11614913 in
pre-mir-196a-2
contributes to the risk of developing breast
40, lung
39 and gastric cancers
41 in the Chinese population (). In each case the rs11614913 variant homozygote CC was associated with increased cancer risk. Risk of developing oesophageal cancer in Caucasian males and never-smokers was significantly associated with the rs11614913 variant homozygote TT, the minor allele in this population
37. Rs11614913 is located in the 3′ passenger (3p) strand mature sequence of
mir-196a-2, thereby possibly affecting both maturation and the repertoire of target mRNAs with which it interacts. Indeed, previous studies have shown that sequence variations in mature and precursor miRNA sequences affect miRNA biogenesis
28,42, and levels of mature miR-196a-2 were lower in CC carriers than in TT carriers
39. Notably, this SNP has also been associated with poor survival in patients with lung cancer
43. Indeed, this was the first demonstration that miRNA-related SNPs could be related to cancer prognosis.
An SNP in the terminal loop of
pre-mir-27a, rs895819, confers a reduced risk of developing breast cancer in families with a history of non-BRCA-related disease
44 and in families with mutant
BRCA2 (REF.
45). It has been shown that artificial mutations in the terminal loop of miRNAs such as
mir-21 and
mir-30 can block miRNA maturation
46, so it is conceivable that the variant allele of rs895819 might impair the maturation of oncogenic
mir-27a, thus explaining the protective effect of the SNP. However, no alterations of free energy or conformation of the miR-27a–mRNA duplex were predicted
in silico, thereby leading to the assumption that the SNP would not affect
mir-27a maturation or targeting
45. Nevertheless, SNPs in pre-miRNAs may affect expression even in the absence of apparent effects on its secondary structure. Such is the case for an SNP in
let-7e (rs41275792), which leads to reduced levels of the mature miRNA
in vivo even though its secondary structure is not predicted to change
29. The location of the rs895819 SNP in the centre of the terminal loop of
mir-27a is likely to decrease the size of the loop and affect the binding of DROSHA, thus decreasing miRNA maturation
29,46. Alternatively, the SNP might influence the binding affinity of several DROSHA inhibitors, such as Lin28 (REFS
47,48).
Another pre-miRNA SNP, rs6505162 in
pre-mir-423, is associated with an increased risk of bladder cancer
35 and ovarian cancer in carriers of mutant
BRCA2 (REF.
45), and decreased risk of oesophageal cancer in Caucasians
37. There is no clear explanation for the opposing effects of this SNP in different cancer types within the same population. Modulations of mature levels of
miR-423 have not yet been functionally linked to this SNP, and the
RNAFold software (see
Further information) does not predict a change in secondary structure
45. An A/C SNP in
mir-30c-2 was predicted to cause the greatest change in target gene identification, and as such was postulated to affect cancer risk
31, although results from a study in hepatocellular carcinoma (HCC) did not support this hypothesis
49.
A unique example of a functional miRNA SNP is rs2910164, which is located in the 3p strand of
mir-146a (). This polymorphism involves a mispairing in the hairpin of the precursor, which leads to altered processing, lower expression of the mature sequence and predisposition to papillary thyroid carcinoma
50. Intriguingly, individuals with a heterozygous genotype have a greater risk of developing papillary thyroid carcinoma than homozygous individuals
50. Heterozygosity as a genetic risk factor is rare, and as postulated by the authors may be a form of genetic epistasis in which the phenotype of the heterozygote differs from the sum of the phenotypes of both alleles. Indeed, in a follow-up study, the same authors found that the SNP fell within the seed of the 3p strand, and would give rise to three mature miRNAs (one from the leading strand and two from the 3p strand) instead of the expected two observed in homozygous individuals
51. The resulting complexity is underscored by the finding that each mature miRNA will target a specific repertoire of mRNAs
51. Interestingly, there is evidence of homozygous to heterozygous somatic mutations at rs2910164 in several patients with papillary thyroid tumours
50. These seminal studies elegantly decipher how a small genetic change can influence the gene expression profile and show that somatic mutations in miRNAs can be an oncogenic event. In separate studies, rs2910164 was associated with an increased risk of developing prostate cancer
52 and HCC
53 in males through reduced expression levels of miR-146a
53. Although not directly associated with breast cancer risk, rs2910164 was associated with a younger age of breast cancer diagnosis in familial breast cancer after adjustment for
BRCA1 and
BRCA2 mutation status
54. The variant allele of rs2910164 leads to increased levels of mature miR-146a and binds with greater affinity to
BRCA1. Predisposition may therefore develop through the downregulation of
BRCA1 (REF.
54). Alternatively, rs2910146 could disrupt the well-documented role of miR-146a as a mediator of the pro-apoptotic transcription factor nuclear factor-κB (NF-κB)
55,56. In support of this possibility, genes involved in the regulation of apoptosis were differentially transcribed in heterozygote rs2910146 carriers
51.
Although studies have started to reveal the nature of the association between miRNA SNPs and cancer risk, several considerations remain: most of the studies used a candidate gene approach; of those that used a systematic approach, their lists are outdated owing to enhanced screening techniques that have identified new miRNA genes and updated builds of genome-wide SNP repositories. According to the most recent updates of miRNAs, 202 pre-miR genes have 283 SNPs (
Supplementary information S1 (table)). It is likely that as new builds of dbSNP are formulated more SNPs with potential relevance to cancer risk will be identified. In addition, the minor allele frequencies of many of the mir-SNPs already identified have not been determined. Therefore, population studies should be conducted that will ascertain whether or not these SNPs are polymorphic and if so in what populations. This is an important consideration, as data are emerging to suggest that some mir-SNPs have evolved to a high level of variance in distinct populations. For example, the variant alleles of several SNPs occur only in populations of African descent
57. These epidemiological associations need to be validated in independent populations and functionally tested
58. Furthermore, the inclusion of mir-SNPs in future genome-wide association studies (GWAs) will help to unveil low-penetrance susceptibility mutations. Most mir-SNPs are not included in current GWAS designs, and as such there is a paucity of information in this regard. The identification of tag sNPs for miRNA-related SNPs will also be a useful endeavour. Clarification of the extent of the pri region of miRNA genes is also needed to more accurately assess miRNA-related genetic variation. Many studies currently limit their analysis of mir-SNPs to the pre and mature regions, as these are clearly defined.