Neuroblastoma, one of the most common solid tumors in childhood, is characterized by diverse clinical phenotypes
1. While a substantial proportion of patients may show a favorable outcome and may even have spontaneous regression of a localized, or even disseminated, tumor
3,4, approximately 50% of cases show an aggressive clinical course with widespread metastases to bone and bone marrow at diagnosis
1. These latter children have survival rates of less than 35% despite aggressive therapy with dose-intensive induction chemotherapy and surgery, followed by myeloablative therapy with stem cell rescue, local radiation therapy and biological response modification using retinoids and/or immunotherapy
1,5,6.
Our recent GWAS demonstrated that three common SNP alleles within the predicted genes
FLJ22536 and
FLJ44180 at chromosome band 6p22 were associated with neuroblastoma
2. No other region of the genome contained SNPs that reached genome-wide significance and survived our replication effort. Of particular interest was the finding that not only were the three 6p22 SNPs associated with the likelihood of developing neuroblastoma, but patients who carried the 6p22 risk alleles were more likely to develop the clinically aggressive form of the disease and suffer tumor recurrence (dbGaP:
http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000124.v1.p1)
2. These data support the hypothesis that the benign and malignant forms of neuroblastoma may represent distinct entities in terms of the genetic events that initiate tumorigenesis.
We therefore performed a second genome-wide analysis, this time limiting the cases to those patients with high-risk neuroblastoma as defined by the Children’s Oncology Group (COG)
1. We identified 397 high-risk cases from the 1032 neuroblastoma patients included in the discovery set from our original study, and analyzed them against the same 2043 unaffected children from the discovery set control group. Quality control filters were applied to SNP genotype data as previously described
2, resulting in a total of 462,866 autosomal SNPs available for analysis (see
Supplementary Information). This analysis confirmed the three previously identified SNPs at chromosome band 6p22 (rs6939340, rs4712653, rs9295536) being significantly associated to high-risk neuroblastoma (;
Pallelic = 3×10
−11, 8×10
−11, 6×10
−10). These results in just 397 cases showed more highly significant
P-values than those observed in the analysis of all 1032 cases with the identical control group
2. In addition, we were able to identify a new association with common intronic SNPs at the
BARD1 (
BRCA1-associated RING domain-1) gene locus at chromosome band 2q35 (), with a total of six SNPs showing allelic test
P-values less than 1×10
−7 (, and
Supplementary Table 1). All
BARD1 SNP
P-values were minimally affected by correction for population stratification based on principal component analysis (
Supplementary Table 1 and
Supplementary Figure 1).
| Table 1Summary of discovery and replication results for six BARD1 SNPs with allelic P<10−7 in discovery case series. |
Associations for all SNPs with
P-values < 1×10
−7 in the discovery set from both the 2q35 and 6p22 regions showed replication in a second independent group of 189 high-risk cases and 1178 unaffected controls genotyped genome-wide ( for
BARD1 SNPs and
Pallelic = 0.010, 0.008, 0.005 for chromosome 6p22 SNPs rs6939340, rs4712653, rs9295536, respectively). In contrast, association with the
BARD1 SNPs was not significant in 575 low- and intermediate-risk patients from the discovery series (
P-values > 0.05), and just three of them reached nominal significance in the total 756 low- and intermediate-risk patients from the discovery and replication sets combined, the smallest allelic
P-value being 0.003 for rs3768716 (
Supplementary Table 2). We then tested the two most significantly associated
BARD1 SNPs (rs6435862 and rs3768716) for association with high-risk neuroblastoma, utilizing the cases within our previous two separate independent case series from the United Kingdom (UK) and the US-based legacy Children’s Cancer Group (CCG) with a high-risk phenotype.
2 Both
BARD1 SNPs were found significantly associated to high-risk neuroblastoma in the UK case series, whereas only rs3768716 showed a significant allelic
P-value in the smaller CCG case series (). Combining data from all four groups of cases and controls showed allelic odds-ratios for these two SNPs of 1.68 each, with
P-values of 9×10
−18 for rs6435862 and of 3×10
−16 for rs3768716 ().
The six genome-wide significant SNPs in the discovery phase are located in introns 1, 3 and 4 of
BARD1 (). Pairwise linkage disequilibrium (LD) analysis showed that these SNPs are in relatively strong LD with each other (r
2=0.47–0.96), but are not in LD with the non-significant SNPs elsewhere within
BARD1 (
Supplementary Fig. 2). Haplotype analysis on the six most significant SNPs in cases (586) and controls (3221) from CHOP discovery and replication sets combined revealed only seven haplotypes with frequency greater than 1% in both cases and controls, and only four with frequency greater than 2% (
Supplementary Table 3). The most frequent haplotype was composed of all major alleles, and the second most frequent of all minor alleles, in both cases and controls. These were the only haplotypes with different frequencies in cases and controls (0.50 in cases and 0.60 in controls the first; 0.31 in cases and 0.21 in controls the second). These results are consistent with the high correlation observed among all SNPs in this region. Logistic regression analysis performed in the same cases and controls showed that a model including only rs3768716 explained the observed association as well as a model including any additional SNPs. It is therefore possible that a single variant in high LD with these SNPs explains all of the observed association, but regional resequencing will likely be required to address this definitively.
BARD1 contains several known coding variants (
http://www.ncbi.nlm.nih.gov/SNP/snp_ref.cgi?locusId=580)
7–10, and it was possible that one or more of these may be disease causal. We therefore selected the three common non-synonymous SNPs (minor allele frequency [MAF] > 0.10) for genotyping; rs1048108 (P24S) in exon 1, rs2229571 (R378S) in exon 4, and rs2070094 (V507M) in exon 6 (). According to HapMap CEU data, these three coding SNPs are in strong LD with the neuroblastoma-associated intronic SNPs rs6744811 (r
2=0.86), rs7557557 (r
2=0.76), and rs7584646 (r
2=0.75). In addition, we also selected two rare non-synonymous SNPs for additional genotyping because of previously reported associations with breast cancer;
7,9,10 rs28997576 (C557S) in exon 7 and rs3738888 (R658C) in exon 10 (). Finally, we also genotyped the common SNP rs7585356 located 3′ downstream of
BARD1 because of it being in LD with a SNP rs16852600 (r
2=0.89) that was of borderline significance, and because of it being located in a highly conserved region of the genome (SNPseek;
http://snp.wustl.edu/cgi-bin/SNPseek/index.cgi). These six SNPs were assayed by TaqMan®-based genotyping in a case series consisting of 540 high-risk neuroblastoma cases from the discovery and replication set, and 1142 controls from the replication set.
Four of these SNPs showed statistically significant association with high-risk neuroblastoma and that these were in strong LD as there were only 6 observed haplotypes with an observed frequency > 2% (
Pallelic ranging from 1×10
−8 for rs2070094 to 3×10
−6 for rs2229571; ;
Supplementary Table 4). However, none of the nsSNPs was more strongly associated with high-risk neuroblastoma than the GWAS intronic SNPs rs3768716 and rs6435862, which yielded allelic ORs of 1.82 (rs3768716: 95% CI: 1.56–2.13, P = 5×10
−14; rs6435862: 95% CI: 1.57–2.11, P = 2×10
−15), when analyzed in a comparable group of 586 cases (genome-wide discovery and replication cases combined) and 1178 controls (genome-wide replication controls). SNP rs28997576 was not polymorphic in our dataset, and association with rs3738888, which had a very low minor allele frequency, was not significant.
| Table 2Results for BARD1 regulatory and coding SNPs genotyped in CHOP cases and controls |
To assess the joint impact on disease risk of the genetic variants at chromosome regions 2q35 and 6p22, we estimated the two-locus genotype odds-ratios for the two most significant SNPs from each region (rs6435862 and rs9295536) in the CHOP discovery and replication sets combined (). Each locus independently contributed to disease risk with odds-ratios for carriers of 1 or 2 risk alleles relatively to non-carriers of 1.52 (95% CI: 1.03–2.24, P = 0.037) for rs6435862, of 1.75 (95% CI: 1.26–2.45, P = 0.001) for rs9295536, and of 2.99 (95% CI: 2.17–4.11, P = 3×10−13) for the two SNPs together. No significant interaction between the two SNPs was detected (P = 0.6).
| Table 3Estimates of high-risk neuroblastoma risk (odds-ratios) by genotype (number of risk alleles) at BARD1 (rs6435862) and 6p22 (rs9295536) |
Taken together, these data strongly support
BARD1 as the second identified susceptibility locus to sporadically occurring neuroblastoma. Coupled with the recent discovery of highly penetrant mutations in the
ALK oncogene as the major cause of hereditary neuroblastoma,
11 the genetic basis of human neuroblastoma is now coming into focus. This report further confirms that common genomic variants are highly associated with this childhood cancer, and unlike our 6p22 discovery within putative transcripts of unknown function,
2 these data clearly implicate a known and well characterized gene. BARD1 heterodimerizes with the familial breast cancer gene product BRCA1,
12 and is considered to be essential for the latter genes known tumor suppressive function. Since pathogenic
BRCA1 mutations interfere with heterodimerization to BARD1, it has been postulated that
BARD1 may also function as a breast cancer susceptibility gene. However, while many studies have investigated the potential role of
BARD1 in breast cancer susceptibility, there is currently no compelling evidence that DNA sequence alterations influence breast cancer pathogenesis and the locus has not emerged in breast cancer GWAS efforts.
13–15 We now report the first definitive evidence that this gene is involved in cancer susceptibility. Ongoing studies are now focused on understanding the biological consequences of these SNP variations at the
BARD1 locus in the developing sympathetic neuroblast, and how these influence malignant transformation.
It is also important to emphasize that this work clearly demonstrates the power of having robust phenotypic information available in GWAS approaches to human disease. Because the 6p22 association was enriched in the more aggressive subset of neuroblastoma cases, we were able to focus a new discovery case series on this clinically important group of patients. It is clear that at a somatic level, a single cancer histology may represent multiple different genomic subsets. Our data suggest that genetic initiating events may predispose not only to cancer, but to a particular subphenotype of the disease, and thus to patient outcome. This may have implications for both screening and identifying critical pathways for targeted therapeutics.