Search tips
Search criteria

Results 1-21 (21)

Clipboard (0)

Select a Filter Below

Year of Publication
Document Types
1.  Rare variants of large effect in BRCA2 and CHEK2 affect risk of lung cancer 
Wang, Yufei | McKay, James D. | Rafnar, Thorunn | Wang, Zhaoming | Timofeeva, Maria | Broderick, Peter | Zong, Xuchen | Laplana, Marina | Wei, Yongyue | Han, Younghun | Lloyd, Amy | Delahaye-Sourdeix, Manon | Chubb, Daniel | Gaborieau, Valerie | Wheeler, William | Chatterjee, Nilanjan | Thorleifsson, Gudmar | Sulem, Patrick | Liu, Geoffrey | Kaaks, Rudolf | Henrion, Marc | Kinnersley, Ben | Vallée, Maxime | LeCalvez-Kelm, Florence | Stevens, Victoria L. | Gapstur, Susan M. | Chen, Wei V. | Zaridze, David | Szeszenia-Dabrowska, Neonilia | Lissowska, Jolanta | Rudnai, Peter | Fabianova, Eleonora | Mates, Dana | Bencko, Vladimir | Foretova, Lenka | Janout, Vladimir | Krokan, Hans E. | Gabrielsen, Maiken Elvestad | Skorpen, Frank | Vatten, Lars | Njølstad, Inger | Chen, Chu | Goodman, Gary | Benhamou, Simone | Vooder, Tonu | Valk, Kristjan | Nelis, Mari | Metspalu, Andres | Lener, Marcin | Lubiński, Jan | Johansson, Mattias | Vineis, Paolo | Agudo, Antonio | Clavel-Chapelon, Francoise | Bueno-de-Mesquita, H.Bas | Trichopoulos, Dimitrios | Khaw, Kay-Tee | Johansson, Mikael | Weiderpass, Elisabete | Tjønneland, Anne | Riboli, Elio | Lathrop, Mark | Scelo, Ghislaine | Albanes, Demetrius | Caporaso, Neil E. | Ye, Yuanqing | Gu, Jian | Wu, Xifeng | Spitz, Margaret R. | Dienemann, Hendrik | Rosenberger, Albert | Su, Li | Matakidou, Athena | Eisen, Timothy | Stefansson, Kari | Risch, Angela | Chanock, Stephen J. | Christiani, David C. | Hung, Rayjean J. | Brennan, Paul | Landi, Maria Teresa | Houlston, Richard S. | Amos, Christopher I.
Nature genetics  2014;46(7):736-741.
We conducted imputation to the 1000 Genomes Project of four genome-wide association studies of lung cancer in populations of European ancestry (11,348 cases and 15,861 controls) and genotyped an additional 10,246 cases and 38,295 controls for follow-up. We identified large-effect genome-wide associations for squamous lung cancer with the rare variants of BRCA2-K3326X (rs11571833; odds ratio [OR]=2.47, P=4.74×10−20) and of CHEK2-I157T (rs17879961; OR=0.38 P=1.27×10−13). We also showed an association between common variation at 3q28 (TP63; rs13314271; OR=1.13, P=7.22×10−10) and lung adenocarcinoma previously only reported in Asians. These findings provide further evidence for inherited genetic susceptibility to lung cancer and its biological basis. Additionally, our analysis demonstrates that imputation can identify rare disease-causing variants having substantive effects on cancer risk from pre-existing GWAS data.
PMCID: PMC4074058  PMID: 24880342
2.  A genome-wide association study of Hodgkin Lymphoma identifies new susceptibility loci at 2p16.1 (REL), 8q24.21, and 10p14 (GATA3) 
Nature genetics  2010;42(12):1126-1130.
To identify predisposition loci for classical Hodgkin Lymphoma (cHL) we conducted a genome-wide association study of 589 cHL cases and 5,199 controls with validation in 4 independent samples totaling 2,057 cases and 3,416 controls. We identified three new susceptibility loci at 2p16.1 (rs1432295, REL; odds ratio [OR]=1.22, Pcombined=1.91×10−8), 8q24.21 (rs2019960, PVT1; OR=1.33, Pcombined=1.26×10−13) and 10p14 (rs501764, GATA3; OR=1.25, Pcombined=7.05×10−8). Furthermore, we confirmed the role of the MHC in disease etiology by revealing a strong HLA association (rs6903608; OR=1.70, Pcombined=2.84×10−50). These data provide new insight into the pathogenesis of cHL.
PMCID: PMC4268499  PMID: 21037568
3.  Deciphering the genetic architecture of low-penetrance susceptibility to colorectal cancer 
Human Molecular Genetics  2013;22(24):5075-5082.
Recent genome-wide association studies (GWASs) have identified common variants at 16 autosomal regions influencing the risk of developing colorectal cancer (CRC). To decipher the genetic basis of the association signals at these loci, we performed a meta-analysis of data from five GWASs, totalling 5626 cases and 7817 controls, using imputation to recover un-typed genotypes. To enhance our ability to discover low-frequency risk variants, in addition to using 1000 Genomes Project data as a reference panel, we made use of high-coverage sequencing data on 253 individuals, 199 with early-onset familial CRC. For 13 of the regions, it was possible to refine the association signal identifying a smaller region of interest likely to harbour the functional variant. Our analysis did not provide evidence that any of the associations at the 16 loci being a consequence of synthetic associations rather than linkage disequilibrium with a common risk variant.
PMCID: PMC3836483  PMID: 23904454
4.  Deciphering the 8q24.21 association for glioma 
Human Molecular Genetics  2013;22(11):2293-2302.
We have previously identified tagSNPs at 8q24.21 influencing glioma risk. We have sought to fine-map the location of the functional basis of this association using data from four genome-wide association studies, comprising a total of 4147 glioma cases and 7435 controls. To improve marker density across the 700 kb region, we imputed genotypes using 1000 Genomes Project data and high-coverage sequencing data generated on 253 individuals. Analysis revealed an imputed low-frequency SNP rs55705857 (P = 2.24 × 10−38) which was sufficient to fully capture the 8q24.21 association. Analysis by glioma subtype showed the association with rs55705857 confined to non-glioblastoma multiforme (non-GBM) tumours (P = 1.07 × 10−67). Validation of the non-GBM association was shown in three additional datasets (625 non-GBM cases, 2412 controls; P = 1.41 × 10−28). In the pooled analysis, the odds ratio for low-grade glioma associated with rs55705857 was 4.3 (P = 2.31 × 10−94). rs55705857 maps to a highly evolutionarily conserved sequence within the long non-coding RNA CCDC26 raising the possibility of direct functionality. These data provide additional insights into the aetiological basis of glioma development.
PMCID: PMC3652416  PMID: 23399484
5.  Common variation at 2q22.3 (ZEB2) influences the risk of renal cancer 
Human Molecular Genetics  2012;22(4):825-831.
Genome-wide association studies (GWASs) of renal cell cancer (RCC) have identified four susceptibility loci thus far. To identify an additional RCC common susceptibility locus, we conducted a GWAS and performed a meta-analysis with published GWASs (totalling 2215 cases and 8566 controls of European background) and followed up the most significant association signals [nine single nucleotide polymorphisms (SNPs) in eight genomic regions] in 3739 cases and 8786 controls. A combined analysis identified a novel susceptibility locus mapping to 2q22.3 marked by rs12105918 (P = 1.80 × 10−8; odds ratio 1.29, 95% CI: 1.18–1.41). The signal localizes to intron 2 of the ZEB2 gene (zinc finger E box-binding homeobox 2). Our findings suggest that genetic variation in ZEB2 influences the risk of RCC. This finding provides further insights into the genetic and biological basis of inherited genetic susceptibility to RCC.
PMCID: PMC3554205  PMID: 23184150
6.  Germline mutations in the proof-reading domains of POLE and POLD1 predispose to colorectal adenomas and carcinomas 
Nature genetics  2012;45(2):136-144.
Many individuals with multiple or large colorectal adenomas, or early-onset colorectal cancer (CRC), have no detectable germline mutations in the known cancer predisposition genes. Using whole-genome sequencing, supplemented by linkage and association analysis, we identified specific heterozygous POLE or POLD1 germline variants in several multiple adenoma and/or CRC cases, but in no controls. The susceptibility variants appear to have high penetrance. POLD1 is also associated with endometrial cancer predisposition. The mutations map to equivalent sites in the proof-reading (exonuclease) domain of DNA polymerases ε and δ, and are predicted to impair correction of mispaired bases inserted during DNA replication. In agreement with this prediction, mutation carriers’ tumours were microsatellite-stable, but tended to acquire base substitution mutations, as confirmed by yeast functional assays. Further analysis of published data showed that the recently-described group of hypermutant, microsatellite-stable CRCs is likely to be caused by somatic POLE exonuclease domain mutations.
PMCID: PMC3785128  PMID: 23263490
8.  Inherited variation at chromosome 12p13.33 including RAD52 influences squamous cell lung carcinoma risk 
Cancer Discovery  2011;2(2):131-139.
While lung cancer is largely caused by tobacco smoking, inherited genetic factors play a role in its etiology. Genome-wide association studies (GWAS) in Europeans have robustly demonstrated only three polymorphic variations influencing lung cancer risk. Tumor heterogeneity may have hampered the detection of association signal when all lung cancer subtypes were analyzed together. In a GWAS of 5,355 European smoking lung cancer cases and 4,344 smoking controls, we conducted a pathway-based analysis in lung cancer histologic subtypes with 19,082 SNPs mapping to 917 genes in the HuGE-defined “inflammation” pathway. We identified a susceptibility locus for squamous cell lung carcinoma (SQ) at 12p13.33 (RAD52, rs6489769), and replicated the association in three independent samples totaling 3,359 SQ cases and 9,100 controls (odds ratio=1.20, Pcombined=2.3×10−8).
The combination of pathway-based approaches and information on disease specific subtypes can improve the identification of cancer susceptibility loci in heterogeneous diseases.
PMCID: PMC3354721  PMID: 22585858
Lung cancer; histology; squamous cell carcinoma; pathway analysis; RAD52
9.  Fine-mapping of colorectal cancer susceptibility loci at 8q23.3, 16q22.1 and 19q13.11: refinement of association signals and use of in silico analysis to suggest functional variation and unexpected candidate target genes 
Human Molecular Genetics  2011;20(14):2879-2888.
We have previously identified several colorectal cancer (CRC)-associated polymorphisms using genome-wide association (GWA) analysis. We sought to fine-map the location of the functional variants for three of these regions at 8q23.3 (EIF3H), 16q22.1 (CDH1/CDH3) and 19q13.11 (RHPN2). We genotyped two case–control sets at high density in the selected regions and used existing data from four other case–control sets, comprising a total of 9328 CRC cases and 10 480 controls. To improve marker density, we imputed genotypes from the 1000 Genomes Project and Hapmap3 data sets. All three regions contained smaller areas in which a cluster of single nucleotide polymorphisms (SNPs) showed clearly stronger association signals than surrounding SNPs, allowing us to assign those areas as the most likely location of the disease-associated functional variant. Further fine-mapping within those areas was generally unhelpful in identifying the functional variation based on strengths of association. However, functional annotation suggested a relatively small number of functional SNPs, including some with potential regulatory function at 8q23.3 and 16q22.1 and a non-synonymous SNP in RPHN2. Interestingly, the expression quantitative trait locus browser showed a number of highly associated SNP alleles correlated with mRNA expression levels not of EIF3H and CDH1 or CDH3, but of UTP23 and ZFP90, respectively. In contrast, none of the top SNPs within these regions was associated with transcript levels at EIF3H, CDH1 or CDH3. Our post-GWA study highlights benefits of fine-mapping of common disease variants in combination with publicly available data sets. In addition, caution should be exercised when assigning functionality to candidate genes in regions discovered through GWA analysis.
PMCID: PMC3118761  PMID: 21531788
10.  Genetic Variants at Chromosomes 2q35, 5p12, 6q25.1, 10q26.13, and 16q12.1 Influence the Risk of Breast Cancer in Men 
PLoS Genetics  2011;7(9):e1002290.
Male breast cancer accounts for approximately 1% of all breast cancer. To date, risk factors for male breast cancer are poorly defined, but certain risk factors and genetic features appear common to both male and female breast cancer. Genome-wide association studies (GWAS) have recently identified common single nucleotide polymorphisms (SNPs) that influence female breast cancer risk; 12 of these have been independently replicated. To examine if these variants contribute to male breast cancer risk, we genotyped 433 male breast cancer cases and 1,569 controls. Five SNPs showed a statistically significant association with male breast cancer: rs13387042 (2q35) (odds ratio (OR)  = 1.30, p = 7.98×10−4), rs10941679 (5p12) (OR = 1.26, p = 0.007), rs9383938 (6q25.1) (OR = 1.39, p = 0.004), rs2981579 (FGFR2) (OR = 1.18, p = 0.03), and rs3803662 (TOX3) (OR = 1.48, p = 4.04×10−6). Comparing the ORs for male breast cancer with the published ORs for female breast cancer, three SNPs—rs13387042 (2q35), rs3803662 (TOX3), and rs6504950 (COX11)—showed significant differences in ORs (p<0.05) between sexes. Breast cancer is a heterogeneous disease; the relative risks associated with loci identified to date show subtype and, based on these data, gender specificity. Additional studies of well-defined patient subgroups could provide further insight into the biological basis of breast cancer development.
Author Summary
Breast cancer is the most common female cancer in the United Kingdom but also occurs in men, albeit at a much lower frequency. Relatively little is known regarding risk factors for male breast cancer. Here, we examine the effect of common genetic variants that are known to be associated with female breast cancer to determine whether they also affect risk of male breast cancer. We show that certain of these variants are also associated with male breast cancer risk but that the magnitudes of their effects differ in males from females. Future analyses of the genetics of male breast cancer may shed light on the biology of both male and female breast cancer.
PMCID: PMC3174231  PMID: 21949660
11.  Multiple Common Susceptibility Variants near BMP Pathway Loci GREM1, BMP4, and BMP2 Explain Part of the Missing Heritability of Colorectal Cancer 
PLoS Genetics  2011;7(6):e1002105.
Genome-wide association studies (GWAS) have identified 14 tagging single nucleotide polymorphisms (tagSNPs) that are associated with the risk of colorectal cancer (CRC), and several of these tagSNPs are near bone morphogenetic protein (BMP) pathway loci. The penalty of multiple testing implicit in GWAS increases the attraction of complementary approaches for disease gene discovery, including candidate gene- or pathway-based analyses. The strongest candidate loci for additional predisposition SNPs are arguably those already known both to have functional relevance and to be involved in disease risk. To investigate this proposition, we searched for novel CRC susceptibility variants close to the BMP pathway genes GREM1 (15q13.3), BMP4 (14q22.2), and BMP2 (20p12.3) using sample sets totalling 24,910 CRC cases and 26,275 controls. We identified new, independent CRC predisposition SNPs close to BMP4 (rs1957636, P = 3.93×10−10) and BMP2 (rs4813802, P = 4.65×10−11). Near GREM1, we found using fine-mapping that the previously-identified association between tagSNP rs4779584 and CRC actually resulted from two independent signals represented by rs16969681 (P = 5.33×10−8) and rs11632715 (P = 2.30×10−10). As low-penetrance predisposition variants become harder to identify—owing to small effect sizes and/or low risk allele frequencies—approaches based on informed candidate gene selection may become increasingly attractive. Our data emphasise that genetic fine-mapping studies can deconvolute associations that have arisen owing to independent correlation of a tagSNP with more than one functional SNP, thus explaining some of the apparently missing heritability of common diseases.
Author Summary
Genome-wide association studies (GWAS) have identified several colorectal cancer (CRC) susceptibility polymorphisms near genes that encode proteins in the bone morphogenetic protein (BMP) pathway. However, most of the inherited susceptibility to CRC remains unexplained. We investigated three of the best candidate BMP genes (GREM1, BMP4, and BMP2) for additional polymorphisms associated with CRC. By extensive validation of polymorphisms with only modest evidence of association in the initial phases of the GWAS, we identified new, independent CRC predisposition polymorphisms close to BMP4 (rs1957636) and BMP2 (rs4813802). Near GREM1, we used additional genotyping around the GWAS-identified polymorphism rs4779584 to demonstrate two independent signals represented by rs16969681 and rs11632715. Common genes with modest effects on disease risk are becoming harder to identify, and approaches based on informed candidate gene selection may become increasingly attractive. In addition, genetic fine mapping around polymorphisms identified in GWAS can deconvolute associations which have arisen owing to two independent functional variants. These types of study can identify some of the apparently missing heritability of common disease.
PMCID: PMC3107194  PMID: 21655089
12.  Chromosome 15q25 (CHRNA3-CHRNA5) Variation Impacts Indirectly on Lung Cancer Risk 
PLoS ONE  2011;6(4):e19085.
Genetic variants at the 15q25 CHRNA5-CHRNA3 locus have been shown to influence lung cancer risk however there is controversy as to whether variants have a direct carcinogenic effect on lung cancer risk or impact indirectly through smoking behavior. We have performed a detailed analysis of the 15q25 risk variants rs12914385 and rs8042374 with smoking behavior and lung cancer risk in 4,343 lung cancer cases and 1,479 controls from the Genetic Lung Cancer Predisposition Study (GELCAPS). A strong association between rs12914385 and rs8042374, and lung cancer risk was shown, odds ratios (OR) were 1.44, (95% confidence interval (CI): 1.29–1.62, P = 3.69×10−10) and 1.35 (95% CI: 1.18–1.55, P = 9.99×10−6) respectively. Each copy of risk alleles at rs12914385 and rs8042374 was associated with increased cigarette consumption of 1.0 and 0.9 cigarettes per day (CPD) (P = 5.18×10−5 and P = 5.65×10−3). These genetically determined modest differences in smoking behavior can be shown to be sufficient to account for the 15q25 association with lung cancer risk. To further verify the indirect effect of 15q25 on the risk, we restricted our analysis of lung cancer risk to never-smokers and conducted a meta-analysis of previously published studies of lung cancer risk in never-smokers. Never-smoker studies published in English were ascertained from PubMed stipulating - lung cancer, risk, genome-wide association, candidate genes. Our study and five previously published studies provided data on 2,405 never-smoker lung cancer cases and 7,622 controls. In the pooled analysis no association has been found between the 15q25 variation and lung cancer risk (OR = 1.09, 95% CI: 0.94–1.28). This study affirms the 15q25 association with smoking and is consistent with an indirect link between genotype and lung cancer risk.
PMCID: PMC3084737  PMID: 21559498
13.  Allelic Variation at the 8q23.3 Colorectal Cancer Risk Locus Functions as a Cis-Acting Regulator of EIF3H 
PLoS Genetics  2010;6(9):e1001126.
Common genetic variation at human 8q23.3 is significantly associated with colorectal cancer (CRC) risk. To elucidate the basis of this association we compared the frequency of common variants at 8q23.3 in 1,964 CRC cases and 2,081 healthy controls. Reporter gene studies showed that the single nucleotide polymorphism rs16888589 acts as an allele-specific transcriptional repressor. Chromosome conformation capture (3C) analysis demonstrated that the genomic region harboring rs16888589 interacts with the promoter of gene for eukaryotic translation initiation factor 3, subunit H (EIF3H). We show that increased expression of EIF3H gene increases CRC growth and invasiveness thereby providing a biological mechanism for the 8q23.3 association. These data provide evidence for a functional basis for the non-coding risk variant rs16888589 at 8q23.3 and provides novel insight into the etiological basis of CRC.
Author Summary
Common inherited variation on human chromosome 8q23 influences the risk of developing colorectal cancer (CRC). To understand the basis of this association we have compared the frequency of common genetic variants at 8q23 in ∼2,000 CRC cases and ∼2,000 healthy controls. Functional analyses of variants strongly associated with CRC risk showed that the single nucleotide polymorphism rs16888589 underscores the 8q23.3 association. The region of the genome harboring rs16888589 increases the expression of the gene for eukaryotic translation initiation factor 3, subunit H. We show that increased expression of this gene increases CRC growth thereby providing a biological mechanism for the 8q23.3 association. This finding is of particular importance in elucidating the etiological basis of CRC.
PMCID: PMC2940760  PMID: 20862326
14.  Multiple Independent Loci at Chromosome 15q25.1 Affect Smoking Quantity: a Meta-Analysis and Comparison with Lung Cancer and COPD 
PLoS Genetics  2010;6(8):e1001053.
Recently, genetic association findings for nicotine dependence, smoking behavior, and smoking-related diseases converged to implicate the chromosome 15q25.1 region, which includes the CHRNA5-CHRNA3-CHRNB4 cholinergic nicotinic receptor subunit genes. In particular, association with the nonsynonymous CHRNA5 SNP rs16969968 and correlates has been replicated in several independent studies. Extensive genotyping of this region has suggested additional statistically distinct signals for nicotine dependence, tagged by rs578776 and rs588765. One goal of the Consortium for the Genetic Analysis of Smoking Phenotypes (CGASP) is to elucidate the associations among these markers and dichotomous smoking quantity (heavy versus light smoking), lung cancer, and chronic obstructive pulmonary disease (COPD). We performed a meta-analysis across 34 datasets of European-ancestry subjects, including 38,617 smokers who were assessed for cigarettes-per-day, 7,700 lung cancer cases and 5,914 lung-cancer-free controls (all smokers), and 2,614 COPD cases and 3,568 COPD-free controls (all smokers). We demonstrate statistically independent associations of rs16969968 and rs588765 with smoking (mutually adjusted p-values<10−35 and <10−8 respectively). Because the risk alleles at these loci are negatively correlated, their association with smoking is stronger in the joint model than when each SNP is analyzed alone. Rs578776 also demonstrates association with smoking after adjustment for rs16969968 (p<10−6). In models adjusting for cigarettes-per-day, we confirm the association between rs16969968 and lung cancer (p<10−20) and observe a nominally significant association with COPD (p = 0.01); the other loci are not significantly associated with either lung cancer or COPD after adjusting for rs16969968. This study provides strong evidence that multiple statistically distinct loci in this region affect smoking behavior. This study is also the first report of association between rs588765 (and correlates) and smoking that achieves genome-wide significance; these SNPs have previously been associated with mRNA levels of CHRNA5 in brain and lung tissue.
Author Summary
Nicotine binds to cholinergic nicotinic receptors, which are composed of a variety of subunits. Genetic studies for smoking behavior and smoking-related diseases have implicated a genomic region that encodes the alpha5, alpha3, and beta4 subunits. We examined genetic data across this region for over 38,000 smokers, a subset of which had been assessed for lung cancer or chronic obstructive pulmonary disease. We demonstrate strong evidence that there are at least two statistically independent loci in this region that affect risk for heavy smoking. One of these loci represents a change in the protein structure of the alpha5 subunit. This work is also the first to report strong evidence of association between smoking and a group of genetic variants that are of biological interest because of their links to expression of the alpha5 cholinergic nicotinic receptor subunit gene. These advances in understanding the genetic influences on smoking behavior are important because of the profound public health burdens caused by smoking and nicotine addiction.
PMCID: PMC2916847  PMID: 20700436
15.  Deciphering the impact of common genetic variation on lung cancer risk: A genome-wide association study 
Cancer research  2009;69(16):6633-6641.
To explore the impact of common variation on the risk of developing lung cancer we conducted a two-phase genome-wide association (GWA) study. In Phase 1, we compared the genotypes of 511,919 tagging single nucleotide polymorphisms (tagSNPs) in 1,952 cases and 1,438 controls; in Phase 2, 30,568 SNPs were genotyped in 2,465 cases and 3,005 controls. SNP selection was based on best supported P-values from Phase 1 and two other GWA studies of lung cancer. In the combined analysis of Phases 1 and 2, the strongest associations identified were defined by SNPs mapping to 15q25.1 (rs12914385; P = 3.19 × 10−16), 5p15.33 (rs4975616; P = 6.66 × 10−7), and 6p21.33 (rs3117582; P = 9.13 × 10−7). Variation at 15q25.1, but not 5p15.33 or 6p21.33, was strongly associated with smoking behaviour with risk alleles correlated to higher consumption. Variation at 5p15.33 was shown to significantly influence induction of lung cancer histology. Pooling data from the four series provided 21,620 genotypes for 7,560 cases and 8,205 controls. A meta-analysis provided increased support that variation at 15q25.1 (rs8034191; P = 3.24 × 10−26), 5p15.33 (rs4975616; P = 2.99 × 10−9), and 6p21.33 (rs3117582; P = 4.46 × 10−10) influences lung cancer risk. The next best-supported associations were attained at 15q15.2 (rs748404: P = 1.08 × 10−6) and 10q23.31 (rs1926203; P = 1.28 × 10−6). These data indicate few common variants account for 1% of the excess familial risk underscoring the necessity of having additional large sample series for gene discovery.
PMCID: PMC2754318  PMID: 19654303
lung cancer; genome-wide association
16.  Common 5p15.33 and 6p21.33 variants influence lung cancer risk 
Nature genetics  2008;40(12):1407-1409.
We conducted a genome-wide association (GWA) study of lung cancer comparing 511,919 SNP genotypes in 1,952 cases and 1,438 controls. The most significant association was attained at 15q25.1 (rs8042374; P = 7.75 × 10−12), confirming recent observations. Pooling data with two other GWA studies (5,095 cases, 5,200 controls) and with replication in an additional 2,484 cases and 3,036 controls, we identified two newly associated risk loci mapping to 6p21.33 (rs3117582, BAT3-MSH5; Pcombined = 4.97 × 10−10) and 5p15.33 (rs401681, CLPTM1L; Pcombined = 7.90 × 10−9).
PMCID: PMC2695928  PMID: 18978787
17.  Genome-wide association scan identifies a colorectal cancer susceptibility locus on 11q23 and replicates risk loci at 8q24 and 18q21 
Nature genetics  2008;40(5):631-637.
In a genome-wide association study to identify loci associated with colorectal cancer (CRC) risk, we genotyped 555,510 SNPs in 1,012 early-onset Scottish CRC cases and 1,012 controls (phase 1.) In phase 2, we genotyped the 15,008 highest-ranked SNPs in 2,057 Scottish cases and 2,111 controls. We then genotyped the five highest-ranked SNPs from the joint phase 1 and 2 analysis in 14,500 cases and 13,294 controls from seven populations, and identified a previously unreported association, rs3802842 on 11q23 (OR = 1.1; P = 5.8 × 10-10), showing population differences in risk. We also replicated and fine-mapped associations at 8q24 (rs7014346; OR = 1.19; P = 8.6 × 10-26) and 18q21 (rs4939827; OR = 1.2; P = 7.8 × 10-28). Risk was greater for rectal than for colon cancer for rs3802842 (P < 0.008) and rs4939827 (P < 0.009). Carrying all six possible risk alleles yielded OR = 2.6 (95% CI = 1.75-3.89) for CRC. These findings extend our understanding of the role of common genetic variation in CRC etiology.
PMCID: PMC2778004  PMID: 18372901
18.  A genome-wide scan of 10 000 gene-centric variants and colorectal cancer risk 
European Journal of Human Genetics  2009;17(11):1507-1514.
Genome scans based on gene-centric single nucleotide polymorphisms (SNPs) have been proposed as an efficient approach to identify disease-causing variants that is complementary to scans based on tagging SNPs. Adopting this approach to identify low-penetrance susceptibility alleles for colorectal cancer (CRC) we analysed genotype data from 9109 gene-centric SNPs, 7014 of which were non-synonymous (nsSNPs), in 2873 cases and 2871 controls using Illumina iselect arrays. Overall the distribution of associations was not significantly different from the null. No SNP achieved globally significant association after correction for multiple testing (lowest P value 1.7 × 10−4, rs727299). We then analysed the dataset incorporating information on the functional consequences of nsSNPs. We used results from the in silico algorithm PolyPhen as prior information to weight the association statistics, with weights estimated from the observed test statistics within predefined groups of SNPs. Incorporating this information did not, however, yield any further evidence of a specific association (lowest P value 2.2 × 10−4, rs1133950). There was a strong relationship between effect size and SNPs predicted to be damaging (P=1.63 × 10−5), however, these variants which are most likely to impact on risk are rare (MAF<5%). Hence although the rationale for searching for low-penetrance cancer susceptibly alleles by conducting genome-wide scans of coding changes is strong, in practice it is likely that natural selection has rendered such alleles to be too rare to be detected by association studies of the size employed.
PMCID: PMC2986682  PMID: 19471308
polymorphism; cancer; risk
19.  Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1 
Nature genetics  2008;40(5):616-622.
To identify risk variants for lung cancer, we conducted a multistage genome-wide association study. In the discovery phase, we analyzed 315,450 tagging SNPs in 1,154 current and former (ever) smoking cases of European ancestry and 1,137 frequency-matched, ever-smoking controls from Houston, Texas. For replication, we evaluated the ten SNPs most significantly associated with lung cancer in an additional 711 cases and 632 controls from Texas and 2,013 cases and 3,062 controls from the UK. Two SNPs, rs1051730 and rs8034191, mapping to a region of strong linkage disequilibrium within 15q25.1 containing PSMA4 and the nicotinic acetylcholine receptor subunit genes CHRNA3 and CHRNA5, were significantly associated with risk in both replication sets. Combined analysis yielded odds ratios of 1.32 (P < 1 × 10−17) for both SNPs. Haplotype analysis was consistent with there being a single risk variant in this region. We conclude that variation in a region of 15q25.1 containing nicotinic acetylcholine receptors genes contributes to lung cancer risk.
PMCID: PMC2713680  PMID: 18385676
20.  Evaluation of NTHL1, NEIL1, NEIL2, MPG, TDG, UNG and SMUG1 genes in familial colorectal cancer predisposition 
BMC Cancer  2006;6:243.
The observation that germline mutations in the oxidative DNA damage repair gene MUTYH cause colorectal cancer (CRC) provides strong evidence that dysregulation of the base excision repair (BER) pathway influences disease susceptibility. It is conceivable that germline sequence variation in other BER pathway genes such as NTHL1, NEIL1, NEIL2, MPG, TDG, UNG and SMUG1 also contribute to CRC susceptibility.
To evaluate whether sequence variants of NTHL1, NEIL1, NEIL2, MPG, TDG, UNG and SMUG1 genes might act as CRC susceptibility alleles, we screened the coding sequence and intron-exon boundaries of these genes in 94 familial CRC cases in which involvement of known genes had been excluded.
Three novel missense variants were identified NEIL2 C367A, TDG3 A196G and UNG2 C262T in patients, which were not observed in 188 healthy control DNAs.
We detected novel germline alterations in NEIL2, TDG and UNG patients with CRC. The results suggest a limited role for NTHL1, NEIL1, NEIL2, MPG, TDG, UNG and SMUG1 in development of CRC.
PMCID: PMC1624846  PMID: 17029639
21.  MiMiR: a comprehensive solution for storage, annotation and exchange of microarray data 
BMC Bioinformatics  2005;6:268.
The generation of large amounts of microarray data presents challenges for data collection, annotation, exchange and analysis. Although there are now widely accepted formats, minimum standards for data content and ontologies for microarray data, only a few groups are using them together to build and populate large-scale databases. Structured environments for data management are crucial for making full use of these data.
The MiMiR database provides a comprehensive infrastructure for microarray data annotation, storage and exchange and is based on the MAGE format. MiMiR is MIAME-supportive, customised for use with data generated on the Affymetrix platform and includes a tool for data annotation using ontologies. Detailed information on the experiment, methods, reagents and signal intensity data can be captured in a systematic format. Reports screens permit the user to query the database, to view annotation on individual experiments and provide summary statistics. MiMiR has tools for automatic upload of the data from the microarray scanner and export to databases using MAGE-ML.
MiMiR facilitates microarray data management, annotation and exchange, in line with international guidelines. The database is valuable for underpinning research activities and promotes a systematic approach to data handling. Copies of MiMiR are freely available to academic groups under licence.
PMCID: PMC1299320  PMID: 16280078

Results 1-21 (21)