Descriptions of the study populations in the combined resource are shown in . Cases from Utah had a higher proportion of first-degree relatives with CRC than cases in the UK cohorts (p
anova <0.001), and a higher proportion of early-onset cases (age 59 and younger), as would be expected for CRC cases selected from high-risk cancer pedigrees (p
anova=0.002). Utah also had a lower proportion of rectal cancer (p
anova <0.001). This is also as expected because the CRC high-risk pedigrees were ascertained primarily for excess of colon cancers. describes the twelve SNPs studied. All SNPs were in Hardy-Weinberg equilibrium. The number of cases and controls, of each SNP genotype, by individual study center is provided in
Supplemental Table 2 (online).
| Table 2Minor allele frequencies (MAFs) in case and control subjects |
The meta association results for each SNP with CRC are shown in . Based on Cochran’s Q test, there were no results that exhibited significant heterogeneity across studies. In 8q24, we observed two nominally significant results for SNPs rs6983267 (p
trend=0.01) and rs10090154 (p
trend=0.05), associated with modest per-allele increased risks. For rs6983267 the high-risk G allele was the major allele in our study population; however, allele T was considered the reference allele to be consistent with previous studies. SNP rs6983267 resides in region 3 of the 8q24 locus (
8). In our controls, the LD between rs6983267 and the other region 3 SNPs studied: rs10505477, rs10808556 and rs7013278, was r
2 of 0.94, 0.65, and 0.57 (|D′ of 0.99, 0.99, and 1.00), respectively. However, none of the other region 3 SNPs were significant. SNP rs10090154 resides in region 5 of 8q24 (
8). One other region 5 SNP with an r
2 of 0.89 (|D′| of 0.98) with rs10090154 was studied (rs1447295), but was not significantly associated with CRC (p=0.10). We inspected the study-specific results for rs6983267 and rs10090154. Even though there was no statistically significant evidence for heterogeneity across the three studies for these two SNPs (both p
hom>0.30), we observed that the association signal for both was primarily driven by the two UK cohorts (
Supplemental Table 3, online). Stratification by family history (
Supplemental Table 4, online), age at diagnosis, a combination of family history and age at diagnosis (
Supplemental Table 5, online), or tumor site, variables that differed between the UK and US sites, did not explain this observation. Potential sources of heterogeneity between the two UK sites and the US site include environmental factors, particularly alcohol consumption and smoking; and phenotypic heterogeneity, particularly that undiscovered high-risk alleles could exist in the Utah pedigrees. However, these tests for heterogeneity could not be directly assessed as the data were not available for this study.
| Table 3Association of candidate SNPs with CRC in meta analysis of Utah, Sheffield, and Leeds studies1 |
To investigate multi-locus associations at 8q24, we applied a haplotype-mining method (
18). Using data for all seven 8q24 SNPs, the method identified a 2-SNP haplotype (G-T) across rs6983267 and rs10090154 variants as the most significantly associated with CRC when compared to a referent haplotype of T-C (p=0.0004). It is also of interest that, in contrast to single-SNP analyses of rs6983267 and rs10090154, the two-locus model (carriage of G-T) association signal (OR
meta=2.04, 95%CI 1.42, 3.34) was more similar across the US and UK cohorts: OR
Sheffield=2.65 (1.34, 5.24); OR
Leeds=2.55 (0.98, 6.63) and OR
Utah=1.62 (0.80, 3.31). These two markers are not in LD (r
2<0.01 and |D′|=0.13) and are considered to reside in two separate 8q24 regions (
8). We simultaneously analyzed both SNPs, each modeled as log-additive. The significance for both SNPs, when included in the same model, was almost unchanged from their single SNP tests (): rs6983267 (p=0.02 compared to p=0.01 in the single SNP test); rs10090154 (p=0.065 compared to p=0.05 in the single SNP test). We further found no significant evidence for multiplicative interaction (p=0.14), although our power to detect interaction was lower than for main effects. Hence, although rs10090154 did not reach significance at the 0.05 level in the simultaneous analysis (p=0.065), the largely unchanged effects when rs6983267 and rs10090154 were considered together or separately, and the distinctly greater risk and significance of the two-locus model are consistent with the conclusions of others that these two SNPs may be independent risk loci for CRC (
6). Independent roles for SNPs in regions 3 and 5 have also been suggested for prostate cancer associations (
8).
There were no significant associations in 9p24 and 18q21 SNPs and overall risk of CRC. Associations at 9p24 did not differ when the resource was stratified by sex, age, family history, age and family history, or tumor site. Associations at 18q21 and overall risk of CRC were improved in the stratification by early-onset and family history (
Supplemental Table 5 online), and were best observed in the stratification by tumor site (). Significant associations were observed for distal colon tumors compared to controls for two SNPs, but not for proximal colon and rectal tumors (). In a distal CRC-control comparison, the minor C allele at rs4939827 was associated with a decreased risk of distal colon tumors (p
trend=0.007) and the minor T allele at rs12953717 with increased risk (p
trend=0.01). The LD between rs4939827 and rs12953717 is reasonably high (r
2=0.59, |D′|=0.92), suggesting redundancy of information between the SNPs that was confirmed in a simultaneous analysis of both SNPs. For rs4939827, there was a statistically significant difference between distal and other CRC sites in a case-case comparison (p=0.03). The finding for distal CRC was consistently observed across the UK and Utah study sites (
Supplemental Table 3). Findings were less significant in each study center considered alone, thus illustrating the benefit of increased power to detect associations from larger sample sizes inherent in a multi-study collaboration. Haplotype-mining did not identify any interesting two-locus associations or a three-locus association with distal CRC.
| Table 4Association of 18q21 SNPs and site-specific CRC in meta analysis of Utah, Sheffield, and Leeds studies1 |