DNA repair genes are important for maintaining genomic stability and limiting carcinogenesis. We analyzed all single nucleotide polymorphisms (SNPs) of 125 DNA repair genes covered by the Illumina HumanHap300 (v1.1) BeadChips in a previously conducted genome-wide association study (GWAS) of 1,154 lung cancer cases and 1,137 controls and replicated the top-hits of XRCC4 SNPs in an independent set of 597 cases and 611 controls in Texas populations. We found that six of 20 XRCC4 SNPs were associated with a decreased risk of lung cancer with a P value of 0.01 or lower in the discovery dataset, of which the most significant SNP was rs10040363 (P for allelic test = 4.89 ×10−4). Moreover, the data in this region allowed us to impute a potentially functional SNP rs2075685 (imputed P for allelic test = 1.3 ×10−3). A luciferase reporter assay demonstrated that the rs2075685G>T change in the XRCC4 promoter increased expression of the gene. In the replication study of rs10040363, rs1478486, rs9293329, and rs2075685, however, only rs10040363 achieved a borderline association with a decreased risk of lung cancer in a dominant model (adjusted OR = 0.80, 95% CI = 0.62–1.03, P = 0.079). In the final combined analysis of both the Texas GWAS discovery and replication datasets, the strength of the association was increased for rs10040363 (adjusted OR = 0.77, 95% CI = 0.66–0.89, Pdominant = 5×10−4 and P for trend = 5×10−4) and rs1478486 (adjusted OR = 0.82, 95% CI = 0.71 −0.94, Pdominant = 6×10−3 and P for trend = 3.5×10−3). Finally, we conducted a meta-analysis of these XRCC4 SNPs with available data from published GWA studies of lung cancer with a total of 12,312 cases and 47,921 controls, in which none of these XRCC4 SNPs was associated with lung cancer risk. It appeared that rs2075685, although associated with increased expression of a reporter gene and lung cancer risk in the Texas populations, did not have an effect on lung cancer risk in other populations. This study underscores the importance of replication using published data in larger populations.
XRCC4; variant; Genetic susceptibility; genome-wide association study; replication study
To identify risk variants for lung cancer, we conducted a multistage genome-wide association study. In the discovery phase, we analyzed 315,450 tagging SNPs in 1,154 current and former (ever) smoking cases of European ancestry and 1,137 frequency-matched, ever-smoking controls from Houston, Texas. For replication, we evaluated the ten SNPs most significantly associated with lung cancer in an additional 711 cases and 632 controls from Texas and 2,013 cases and 3,062 controls from the UK. Two SNPs, rs1051730 and rs8034191, mapping to a region of strong linkage disequilibrium within 15q25.1 containing PSMA4 and the nicotinic acetylcholine receptor subunit genes CHRNA3 and CHRNA5, were significantly associated with risk in both replication sets. Combined analysis yielded odds ratios of 1.32 (P < 1 × 10−17) for both SNPs. Haplotype analysis was consistent with there being a single risk variant in this region. We conclude that variation in a region of 15q25.1 containing nicotinic acetylcholine receptors genes contributes to lung cancer risk.
Published genome-wide association studies (GWASs) have identified few variants in the known biological pathways involved in lung cancer etiology. To mine the possibly hidden causal single nucleotide polymorphisms (SNPs), we explored all SNPs in the extrinsic apoptosis pathway from our published GWAS dataset for 1154 lung cancer cases and 1137 cancer-free controls. In an initial association analysis of 611 tagSNPs in 41 apoptosis-related genes, we identified only 10 tagSNPs associated with lung cancer risk with a P value <10−2, including four tagSNPs in DAPK1 and three tagSNPs in TNFSF8. Unlike DAPK1 SNPs, TNFSF8 rs2181033 tagged other four predicted functional but untyped SNPs (rs776576, rs776577, rs31813148 and rs2075533) in the promoter region. Therefore, we further tested binding affinity of these four SNPs by performing the electrophoretic mobility shift assay. We found that only rs2075533T allele modified levels of nuclear proteins bound to DNA, leading to significantly decreased expression of luciferase reporter constructs by 5- to –10-fold in H1299, HeLa and HCT116 cell lines compared with the C allele. We also performed a replication study of the untyped rs2075533 in an independent Texas population but did not confirm the protective effect. We further performed a mini meta-analysis for SNPs of TNFSF8 obtained from other four published lung cancer GWASs with 12 214 cases and 47 721 controls, and we found that only rs3181366 (r2 = 0.69 with the untyped rs2075533) was associated to lung cancer risk (P = 0.008). Our findings suggest a possible role of novel TNFSF8 variants in susceptibility to lung cancer.
Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene–environment interactions. To determine gene–asbestos interactions in lung cancer risk, we conducted genome-wide gene–environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10–6, which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10–5). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene–asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.
Abbreviations:CIconfidence intervalEenvironmentFDRfalse discovery rateGgeneGSEAgene-set-enrichment analysisGWASgenome-wide association studiesi-GSEAimproved gene-set-enrichment analysis approachORodds ratioSNPsingle nucleotide polymorphism
We adopted a two-stage study design to screen 927 single nucleotide polymorphisms (SNPs) located in 73 apoptotic-pathway genes in a case-control study and then performed a fast-track validation of the significant SNPs in a replication population to identify sequence variations in the apoptotic pathway modulating lung cancer risk. Fifty-five SNPs showed significant associations in the discovery population comprised of 661 lung cancer cases and 959 controls. Six of these SNPs located in three genes (Bcl-2, CASP9 and ANKS1B) were validated in a replication population with 1154 cases and 1373 controls. Additive model was the best-fitting model for five SNPs (rs1462129 and rs255102 of Bcl-2, rs6685648 of CASP9 and rs1549102, rs11110099 of ANKS1B) and recessive model was the best fit for one SNP (rs10745877 of ANKS1B). In the analysis of joint effects with subjects carrying no unfavorable genotypes as the reference group, those carrying one, two, and three or more unfavorable genotypes had an odds ratio (OR) of 2.22 [95% confidence interval (CI) = 1.08–4.57, P = 0.03], 2.70 (95% CI = 1.33–5.49; P = 0.006) and 4.13 (95% CI = 2.00–8.57; P = 0.0001), respectively (P for trend = 6.05E-06). The joint effect of unfavorable genotypes was also validated in the replication population. The SNPs identified are located in or near key genes known to play important roles in apoptosis regulation, supporting the strong biological relevance of our findings. Future studies are needed to identify the causal SNPs and elucidate the underlying molecular mechanisms.
Despite the central role of estrogen exposure in breast and endometrial cancer development and numerous studies of genes in the estrogen metabolic pathway, polymorphisms within the pathway have not been consistently associated with these cancers. We posit that this is due to the complexity of multiple weak genetic effects within the metabolic pathway that can only be effectively detected through multi-variant analysis. We conducted a comprehensive association analysis of the estrogen metabolic pathway by interrogating 239 tagSNPs within 35 genes of the pathway in three tumor samples. The discovery sample consisted of 1,596 breast cancer cases, 719 endometrial cancer cases, and 1,730 controls from Sweden; and the validation sample included 2,245 breast cancer cases and 1,287 controls from Finland. We performed admixture maximum likelihood (AML)–based global tests to evaluate the cumulative effect from multiple SNPs within the whole metabolic pathway and three sub-pathways for androgen synthesis, androgen-to-estrogen conversion, and estrogen removal. In the discovery sample, although no single polymorphism was significant after correction for multiple testing, the pathway-based AML global test suggested association with both breast (pglobal = 0.034) and endometrial (pglobal = 0.052) cancers. Further testing revealed the association to be focused on polymorphisms within the androgen-to-estrogen conversion sub-pathway, for both breast (pglobal = 0.008) and endometrial cancer (pglobal = 0.014). The sub-pathway association was validated in the Finnish sample of breast cancer (pglobal = 0.015). Further tumor subtype analysis demonstrated that the association of the androgen-to-estrogen conversion sub-pathway was confined to postmenopausal women with sporadic estrogen receptor positive tumors (pglobal = 0.0003). Gene-based AML analysis suggested CYP19A1 and UGT2B4 to be the major players within the sub-pathway. Our study indicates that the composite genetic determinants related to the androgen–estrogen conversion are important for the induction of two hormone-associated cancers, particularly for the hormone-driven breast tumour subtypes.
Estrogen exposure is the most important risk factor for breast and endometrial cancers. Genetic variation of the genes involved in estrogen metabolism has, however, not been consistently associated with these two cancers. We posited that the genetic risk associated with the estrogen metabolic genes is likely to be carried by multiple variants and is therefore most effectively detected by multi-variant analysis. We carried out a comprehensive association analysis of the estrogen metabolic pathway by interrogating SNPs within 35 genes of the pathway in three tumor samples from Sweden and Finland. Through pathway-based multi-variant association analysis, we showed that the genetic variation within the estrogen metabolic pathway is associated with risk for breast and endometrial cancers and that the genetic variation within the genes involved in androgen-to-estrogen conversion is particularly important for the development of ER–positive and sporadic breast tumors in postmenopausal women. Our study has demonstrated that the influence of genetic variation on hormone exposure has an impact on breast cancer development, especially on the development of hormone-driven breast tumor subtypes. Our study has also highlighted that future genetic studies of the estrogen metabolic genes should focus on the androgen-to-estrogen conversion process.
Genome-wide association studies (GWAS) have identified a number of genetic variants associated with lung cancer risk. However, these loci explain only a small fraction of lung cancer hereditability and other variants with weak effect may be lost in the GWAS approach due to the stringent significance level after multiple comparison correction. In this study, in order to identify important pathways involving the lung carcinogenesis, we performed a two-stage pathway analysis in GWAS of lung cancer in Han Chinese using gene set enrichment analysis (GSEA) method. Predefined pathways by BioCarta and KEGG databases were systematically evaluated on Nanjing study (Discovery stage: 1,473 cases and 1,962 controls) and the suggestive pathways were further to be validated in Beijing study (Replication stage: 858 cases and 1,115 controls). We found that four pathways (achPathway, metPathway, At1rPathway and rac1Pathway) were consistently significant in both studies and the P values for combined dataset were 0.012, 0.010, 0.022 and 0.005 respectively. These results were stable after sensitivity analysis based on gene definition and gene overlaps between pathways. These findings may provide new insights into the etiology of lung cancer.
Objectives: Using a novel candidate SNP approach, we aimed to identify a possible genetic basis for the higher glioma incidence in Whites relative to East Asians and African-Americans. Methods: We hypothesized that genetic regions containing SNPs with extreme differences in allele frequencies across ethnicities are most likely to harbor susceptibility variants. We used International HapMap Project data to identify 3,961 candidate SNPs with the largest allele frequency differences in Whites compared to East Asians and Africans and tested these SNPs for association with glioma risk in a set of White cases and controls. Top SNPs identified in the discovery dataset were tested for association with glioma in five independent replication datasets. Results: No SNP achieved statistical significance in either the discovery or replication datasets after accounting for multiple testing or conducting meta-analysis. However, the most strongly associated SNP, rs879471, was found to be in linkage disequilibrium with a previously identified risk SNP, rs6010620, in RTEL1. We estimate rs6010620 to account for a glioma incidence rate ratio of 1.34 for Whites relative to East Asians. Conclusion: We explored genetic susceptibility to glioma using a novel candidate SNP method which may be applicable to other diseases with appropriate epidemiologic patterns.
glioma; candidate SNP association study; ancestry informative markers; admixture; race; ethnicity; brain cancer
Epidemiological and pedigree studies suggest that lung cancer results from the combined effects of age, smoking, impaired lung function and genetic factors. In a case control association study of healthy smokers and lung cancer cases, we identified genetic markers associated with either susceptibility or protection to lung cancer.
We screened 157 candidate single nucleotide polymorphisms (SNP) in a discovery cohort of 439 subjects (200 controls and 239 lung cancer cases) and identified 30 SNPs associated with either the healthy smokers (protective) or lung cancer (susceptibility) phenotype. After genotyping this 30 SNP panel in a validation cohort of 491 subjects (248 controls and 207 lung cancers) and, using the same protective and susceptibility genotypes from our discovery cohort, a 20 SNP panel was selected based on replication of SNP associations in the validation cohort. Following multivariate logistic regression analyses, including the selected SNPs from runs 1 and 2, we found age and family history of lung cancer to be significantly and independently associated with lung cancer. Numeric scores were assigned to both the SNP and demographic data, and combined to form a simple algorithm of risk.
Significant differences in the distribution of the lung cancer susceptibility score was found between normal controls and lung cancer cases, which remained after accounting for differences in lung function. Validation in other case-control and prospective cohorts are underway to further define the potential clinical utility of this model.
Background. Analysis of candidate genes in individual studies has had only limited success in identifying particular gene variants that are conclusively associated with lung cancer risk. In the International Lung Cancer Consortium (ILCCO), we conducted a coordinated genotyping study of 10 common variants selected because of their prior evidence of an association with lung cancer. These variants belonged to candidate genes from different cancer-related pathways including inflammation (IL1B), folate metabolism (MTHFR), regulatory function (AKAP9 and CAMKK1), cell adhesion (SEZL6) and apoptosis (FAS, FASL, TP53, TP53BP1 and BAT3). Methods. Genotype data from 15 ILCCO case–control studies were available for a total of 8431 lung cancer cases and 11 072 controls of European descent and Asian ethnic groups. Unconditional logistic regression was used to model the association between each variant and lung cancer risk. Results. Only the association between a non-synonymous variant of TP53BP1 (rs560191) and lung cancer risk was significant (OR = 0.91, P = 0.002). This association was more striking for squamous cell carcinoma (OR = 0.86, P = 6 × 10−4). No heterogeneity by center, ethnicity, smoking status, age group or sex was observed. In order to confirm this association, we included results for this variant from a set of independent studies (9966 cases/11 722 controls) and we reported similar results. When combining all these studies together, we reported an overall OR = 0.93 (0.89–0.97) (P = 0.001). This association was significant only for squamous cell carcinoma [OR = 0.89 (0.85–0.95), P = 1 × 10−4]. Conclusion. This study suggests that rs560191 is associated to lung cancer risk and further highlights the value of consortia in replicating or refuting published genetic associations.
To evaluate association with polycystic ovary syndrome (PCOS) of 295 variants in 39 genes central to metabolic insulin signaling and glycogen synthase kinase-3β (GSK-3β) regulation, followed by replication efforts.
Case-control association study, with discovery and replication cohorts.
Subjects were recruited from reproductive endocrinology clinics, controls were recruited from communities surrounding the University of Alabama at Birmingham and Erasmus Medical Center.
273 cases with PCOS and 173 controls in the discovery cohort; 526 cases and 3585 controls in the replication cohort. All subjects were Caucasian.
Phenotypic and genotypic assessment.
Main Outcome Measures
295 SNPs, PCOS status.
Several SNPs were associated with PCOS in the discovery cohort. Four insulin receptor (INSR) SNPs and three insulin receptor substrate 2 (IRS2) SNPs associated with PCOS (P<0.05) were genotyped in the replication cohort. One INSR SNP (rs2252673) replicated association with PCOS. The minor allele conferred increased odds of PCOS in both cohorts, independent of body mass index (BMI).
A pathway-based, tagging SNP approach allowed us to identify novel INSR SNPs associated with PCOS, one of which confirmed association in a large replication cohort.
PCOS; INSR; replication; SNP
Reduced DNA repair capacity has been proposed as a predisposing factor for melanoma. We comprehensively evaluated 1,463 genetic variants across 60 DNA repair–related pathway genes in relation to melanoma risk in a nested case-control study of 218 melanoma cases (20% on head and neck) and 218 matched controls within the Nurses' Health Study (NHS). We then genotyped the two variants with the smallest P value in two replication sets: 184 melanoma cases (28% on head and neck) and 184 matched controls in the Health Professionals Follow-Up Study (HPFS); and 183 melanoma cases (10% on head and neck) and 183 matched controls in the NHS. The SNP rs3219125 in the PARP1 gene was significantly associated with melanoma risk in the discovery set (odds ratio (OR) 3.14; 95% confidence interval (CI) 1.70–5.80) and in the HPFS replication set (OR, 1.92; 95%CI, 1.05–3.54) but not in the NHS replication set (OR, 1.07; 95%CI, 0.58–1.97). In the joint analysis, the OR was 1.89 (95%CI, 1.34–2.68) for this polymorphism, and this increased risk was more pronounced among patients with lesions in head/neck (OR, 3.19; 95% CI, 1.77–5.73 for head/neck, and OR, 1.54; 95% CI, 1.03–2.30 for other sites, P value for heterogeneity test = 0.036). Our findings suggest the possible involvement of the PARP1 variant in melanoma development, especially for sites with high sun exposure. Further work on fine-mapping and on the functional characterization of this and linked SNPs in this region is required.
DNA Repair; Melanoma; Etiology; Polymorphism
Heritable risk for breast cancer includes an increasing number of common, low effect risk variants. We conducted a multistage genetic association study in a series of independent epidemiologic breast cancer study populations to identify novel breast cancer risk variants.
We tested 1,162 SNPs of greatest nominal significance from stage I of the Cancer Genetic Markers of Susceptibility breast cancer study (CGEMS; 1,145 cases, 1,142 controls) for evidence of replicated association with breast cancer in the Nashville Breast Cohort (NBC; 599 cases, 1,161 controls), the Collaborative Breast Cancer Study (CBCS; 1,552 cases, 1,185 controls), and BioVU Breast Cancer Study (BioVU; 1,172 cases, 1,172 controls).
Among these SNPs, a series of validated breast cancer risk variants yielded expected associations in the study populations. In addition, we observed two previously unreported loci that were significantly associated with breast cancer risk in the CGEMS, NBC, and CBCS study populations and had a consistent, although not statistically significant, risk effect in the BioVU study population. These were rs1626678 at 10q25.3 near ENO4 and KIAA1598 (meta-analysis age-adjusted OR 1.13 [1.07–1.20], P = 5.6 × 10−5), and rs8046508 at 16q23.1 in the eighth intron of WWOX (meta-analysis age-adjusted OR = 1.20 [1.10–1.31], P = 3.5 × 10−5).
Our data supports the association of two novel loci, at 10q25.3 and 16q23.1, with risk of breast cancer.
The expanding compendium of known breast cancer genetic risk variants holds increasing power for clinical risk prediction models of breast cancer, improving upon the Gail model.
Clinical factors predicting pulmonary complications after lung resection have been well described, whereas the role of genetics is unknown. The vascular endothelial growth factor (VEGF) signaling pathway has been linked to acute lung injury. We hypothesized that genetic variations in this pathway may be associated with postoperative pulmonary complications after lung resection.
One hundred ninety-six single nucleotide polymorphisms (SNPs) in 17 genes in the VEGF pathway were genotyped in a discovery set of 264 patients and a replication set of 264 patients who underwent lobectomy for lung cancer. Multivariable analysis adjusting for baseline clinical factors was used to identify SNPs associated with pulmonary complications. Cumulative and classification and regression tree (CART) analyses were used to further stratify risk groups.
The overall number of pulmonary complications was 164/528 (31%). The effects of 6 SNPs were consistent in the discovery and replication sets (pooled p value < 0.05). The rs9319425 SNP in the VEGF receptor gene FLT1 resulted in a 1.50-fold increased risk (1.15–1.96; p = 0.003). A cumulative effect for the number of risk genotypes and complications was also evident (p < 0.01). Patients carrying 5 risk genotypes had a 5.76-fold increase in risk (2.73–12.16; p = 4.44 × 10−6). Regression tree analysis identified potential gene-gene interactions between FLT1:rs9319425 and RAF1:rs713178. The addition of the 6 SNPs to the clinical model increased the area under the receiver operating characteristic curve by 6.8%.
Genetic variations in the VEGF pathway are associated with risk of pulmonary complications after lobectomy. This may offer insight into the underlying biological mechanisms of pulmonary complications.
Acute Lung Injury (ALI) is a syndrome with high associated mortality characterized by severe hypoxemia and pulmonary infiltrates in patients with critical illness. We conducted the first investigation to use the genome wide association (GWA) approach to identify putative risk variants for ALI. Genome wide genotyping was performed using the Illumina Human Quad 610 BeadChip. We performed a two-stage GWA study followed by a third stage of functional characterization. In the discovery phase (Phase 1), we compared 600 European American trauma-associated ALI cases with 2266 European American population-based controls. We carried forward the top 1% of single nucleotide polymorphisms (SNPs) at p<0.01 to a replication phase (Phase 2) comprised of a nested case-control design sample of 212 trauma-associated ALI cases and 283 at-risk trauma non-ALI controls from ongoing cohort studies. SNPs that replicated at the 0.05 level in Phase 2 were subject to functional validation (Phase 3) using expression quantitative trait loci (eQTL) analyses in stimulated B-lymphoblastoid cell lines (B-LCL) in family trios. 159 SNPs from the discovery phase replicated in Phase 2, including loci with prior evidence for a role in ALI pathogenesis. Functional evaluation of these replicated SNPs revealed rs471931 on 11q13.3 to exert a cis-regulatory effect on mRNA expression in the PPFIA1 gene (p = 0.0021). PPFIA1 encodes liprin alpha, a protein involved in cell adhesion, integrin expression, and cell-matrix interactions. This study supports the feasibility of future multi-center GWA investigations of ALI risk, and identifies PPFIA1 as a potential functional candidate ALI risk gene for future research.
Chronic inflammation is an important mechanism for the development and progression of prostate cancer. To better understand the potential relationship between genes in the inflammation pathway and prostate cancer (PC) risk, we evaluated variants in 16 candidate genes.
A total of 143 tagging and amino acid altering single nucleotide polymorphisms (SNPs) were genotyped in Caucasian and African American men participating in one of two population-based, case-control studies (n = 1,458 cases and 1,351 controls). The relative risk of prostate cancer was estimated using logistic and polytomous regression models.
Ten SNPs in seven genes (CXCL12, IL4, IL6, IL6ST, PTGS2, STAT3, and TNF) were nominally associated (p<0.05) with risk of PC in Caucasians. The most significant effect on risk was seen with rs11574783 in the IL6ST gene (odds ratio, OR=0.08, 95% CI 0.01–0.63). Cumulatively, four SNPs in genes IL4, IL6ST, PTGS2, and STAT3 conferred a three-fold elevation in PC risk among men carrying the maximum number of high-risk alleles (OR=2.97, 95% CI 1.41–6.25, ptrend = 0.0003). Risk estimates for seven SNPs varied significantly according to disease aggressiveness (phomogeneity<0.05), with SNPs in AKT1, PIK3R1 and STAT3 independently associated with more aggressive PC; OR=5.1 (95% CI 2.29–11.40, ptrend = 3.8×10−5) for carriers of all high-risk genotypes.
These results suggest that variants in genes within the inflammation pathway may play a role in the development of PC, however further studies are needed to replicate our findings.
These results underline the potential importance of the inflammation pathway in PC development and progression.
prostate cancer; genetic association; inflammation; genetic variation
The high incidence of lung cancer in Xuanwei County, China has been attributed to exposure to indoor smoky coal emissions that contain polycyclic aromatic hydrocarbons. The inflammatory response induced by coal smoke components may promote lung tumor development. We studied the association between single nucleotide polymorphisms (SNP) in genes involved in innate immunity and lung cancer risk in a population-based case-control study (122 cases and 122 controls) in Xuanwei. A total of 1,360 tag SNPs in 149 gene regions were included in the analysis. FCER2 rs7249320 was the most significant SNP (OR: 0.30; 95% CI: 0.16–0.55; P, 0.0001; false discovery rate value, 0.13) for variant carriers. The gene regions ALOX12B/ALOX15B and KLK2 were associated with increased lung cancer risk globally (false discovery rate value < 0.15). In addition, there were positive interactions between KLK15 rs3745523 and smoky coal use (OR: 9.40; P interaction = 0.07), and between FCER2 rs7249320 and KLK2 rs2739476 (OR: 10.77; P interaction = 0.003). Our results suggest that genetic polymorphisms in innate immunity genes may play a role in the carcinogenesis of lung cancer caused by polycyclic aromatic hydrocarbon-containing coal smoke. Integrin/receptor and complement pathways as well as IgE regulation are particular noteworthy.
lung cancer; innate immunity; single nucleotide polymorphism; polycyclic aromatic hydrocarbon; coal; FERC2; KLK
Genetic factors play important roles in lung cancer susceptibility. In this study, we replicated the association of 5p15.33 and 6p21.33 with familial lung cancer. Taking into account the previously identified genetic susceptibility variants on 6q23-25/RGS17 and 15q24-25.1, we further determined the cumulative association of these four genetic regions and the population attributable risk percent of familial lung cancer they account for.
One hundred ninety-four case patients and 219 cancer-free control subjects from the Genetic Epidemiology of Lung Cancer Consortium were used for the association analysis. Each familial case was chosen from one high-risk lung cancer family that has three or more affected members. Single nucleotide polymorphisms (SNP) on chromosomal regions 5p15.33, 6p21.33, 6q23-25/RGS17, and 15q24-25.1 were assessed for their associations with familial lung cancer. The cumulative association of the four chromosomal regions with familial lung cancer was evaluated with the use of a linear logistic model. Population attributable risk percent was calculated for each SNP using risk ratio.
SNP rs31489 showed the strongest evidence of familial lung cancer association on 5p15.33 (P = 2 × 10−4; odds ratio, 0.57; 95% confidence interval, 0.42-0.77), whereas rs3117582 showed a weak association on 6p21.33 (P = 0.09; odds ratio, 1.47; 95% confidence interval, 0.94-2.31). Analysis of a combination of SNPs from the four regions provided a stronger cumulative association with familial lung cancer (P = 6.70 × 10−6) than any individual SNPs. The risk of lung cancer was increased to 3- to 11-fold among those subjects who had at least one copy of risk allele at each region compared with subjects without any of the risk factors. These four genetic regions contribute to a total of 34.6% of familial lung cancer in smokers.
The SNPs in four chromosomal regions have a cumulative and significant association with familial lung cancer and account for about one-third of the population attributable risk for familial lung cancer.
Associations between single nucleotide polymorphisms (SNPs) at 5p15 and multiple cancer types have been reported. We have previously shown evidence for a strong association between prostate cancer (PrCa) risk and rs2242652 at 5p15, intronic in the telomerase reverse transcriptase (TERT) gene that encodes TERT. To comprehensively evaluate the association between genetic variation across this region and PrCa, we performed a fine-mapping analysis by genotyping 134 SNPs using a custom Illumina iSelect array or Sequenom MassArray iPlex, followed by imputation of 1094 SNPs in 22 301 PrCa cases and 22 320 controls in The PRACTICAL consortium. Multiple stepwise logistic regression analysis identified four signals in the promoter or intronic regions of TERT that independently associated with PrCa risk. Gene expression analysis of normal prostate tissue showed evidence that SNPs within one of these regions also associated with TERT expression, providing a potential mechanism for predisposition to disease.
While lung cancer is largely caused by tobacco smoking, inherited genetic factors play a role in its etiology. Genome-wide association studies (GWAS) in Europeans have robustly demonstrated only three polymorphic variations influencing lung cancer risk. Tumor heterogeneity may have hampered the detection of association signal when all lung cancer subtypes were analyzed together. In a GWAS of 5,355 European smoking lung cancer cases and 4,344 smoking controls, we conducted a pathway-based analysis in lung cancer histologic subtypes with 19,082 SNPs mapping to 917 genes in the HuGE-defined “inflammation” pathway. We identified a susceptibility locus for squamous cell lung carcinoma (SQ) at 12p13.33 (RAD52, rs6489769), and replicated the association in three independent samples totaling 3,359 SQ cases and 9,100 controls (odds ratio=1.20, Pcombined=2.3×10−8).
The combination of pathway-based approaches and information on disease specific subtypes can improve the identification of cancer susceptibility loci in heterogeneous diseases.
Lung cancer; histology; squamous cell carcinoma; pathway analysis; RAD52
Chromosome 5p15.33 has been identified by genome-wide association studies as one of the regions that associate with lung cancer risk. A few single-nucleotide polymorphisms (SNPs) in the telomerase reverse transcriptase (TERT) and cleft lip and palate transmembrane 1-like (CLPTM1L) genes located in this region have shown consistent associations. We performed dense genotyping of SNPs in this region to refine the previously reported association signals for lung cancer risk. Two hundred and fifteen SNPs were genotyped on an Illumina iSelect panel, in a hospital-based case–control study of 1681 lung cancer cases and 1235 unaffected controls. Association was tested using unconditional logistic regression, while adjusting for age, sex and pack-years smoked. Furthermore, since many of the SNPs were in linkage disequilibrium (LD), haplotype blocks were constructed, from which tagging SNPs at an r2 threshold of ≥0.95 were included in a stepwise forward selection logistic regression model. Of the 215 SNPs, 69 were significant at P < 0.05 in univariate analysis; of these, 35 SNPs meeting the r2 threshold were included in the multiple logistic regression model. Two SNPs, rs370348 (odds ratio = 0.76, P = 1.6 × 10−6) and rs4975538 (odds ratio = 1.18, P = 0.005), significantly associated with risk in the overall sample. Among ever smokers, rs4975615 (odds ratio = 0.75, P = 1.2 × 10−4) and rs4975538 (odds ratio = 1.26, P = 0.002) were significant, whereas among never-smokers, rs451360 (odds ratio = 0.62, P = 7.6 × 10−5) was significant. We refined the consistent association signal in this region, allowing for the considerable LD between SNPs and identified four novel SNPs that were independently and significantly associated with lung cancer risk. Results of these analyses strongly suggest effects on risk from several loci in the TERT/CLPTM1L region.
A new understanding of the genetic basis of coronary artery disease (CAD) has recently emerged from genome-wide association (GWA) studies of common single-nucleotide polymorphisms (SNPs), thus far performed mostly in European-descent populations. To identify novel susceptibility gene variants for CAD and confirm those previously identified mostly in populations of European descent, a multistage GWA study was performed in the Japanese. In the discovery phase, we first genotyped 806 cases and 1337 controls with 451 382 SNP markers and subsequently assessed 34 selected SNPs with direct genotyping (541 additional cases) and in silico comparison (964 healthy controls). In the replication phase, involving 3052 cases and 6335 controls, 12 SNPs were tested; CAD association was replicated and/or verified for 4 (of 12) SNPs from 3 loci: near BRAP and ALDH2 on 12q24 (P=1.6 × 10−34), HLA-DQB1 on 6p21 (P=4.7 × 10−7), and CDKN2A/B on 9p21 (P=6.1 × 10−16). On 12q24, we identified the strongest association signal with the strength of association substantially pronounced for a subgroup of myocardial infarction cases (P=1.4 × 10−40). On 6p21, an HLA allele, DQB1*0604, could show one of the most prominent association signals in an ∼8-Mb interval that encompasses the LTA gene, where an association with myocardial infarction had been reported in another Japanese study. CAD association was also identified at CDKN2A/B, as previously reported in different populations of European descent and Asians. Thus, three loci confirmed in the Japanese GWA study highlight the likely presence of risk alleles with two types of genetic effects – population specific and common – on susceptibility to CAD.
coronary artery disease; gene; association study; Japanese
Using a genome-wide association scan and DNA pooling, we previously identified 5 novel genetic susceptibility loci for Behçet’s disease. Herein, we establish the genetic effect within the UBAC2 gene, replicate this genetic association, and identify a functional variant within this locus.
A total of 676 Behçet’s disease patients and 1,096 controls were studied. The discovery set included 156 patients and 167 controls from Turkey, and the replication sets included 376 patients and 369 controls, and 144 patients and 560 controls, from Turkey and Italy, respectively. Genotyping of 14 SNPs within and around UBAC2 was performed using TaqMan SNP genotyping assays.
The genetic association between Behçet’s disease and UBAC2 was established, replicated and confirmed (Meta-analysis OR= 1.84, meta-analysis P= 1.69X10−7). Haplotype analysis identified both a disease risk and a protective haplotype (P= 0.00014 and 0.0075, respectively). Using conditional haplotype analysis we identified the SNP rs7999348 (A/G) within UBAC2 as the most likely SNP with a genetic effect independent of the haplotypic effect formed by the remaining associated SNPs in this locus. Indeed, we demonstrate that rs7999348 tags a functional variant associated with increased mRNA expression of a UBAC2 transcript variant in PBMCs of individuals homozygous for the Behçet’s disease-associated “G” allele. Further, our data suggest the possibility of multiple genetic effects that increase susceptibility to Behçet’s disease in the UBAC2 locus.
We established and confirmed the genetic association between UBAC2 and Behçet’s disease in three independent sets of patients and controls. We identified the minor allele in rs7999348 as a disease-risk allele that tags altered UBAC2 expression.
Given that genome wide association studies (GWAS) of psychiatric disorders have identified only a small number of convincingly associated variants, there is interest in seeking additional evidence for associated variants using tests of gene-gene interaction. Comprehensive pair-wise SNP-SNP interaction analysis is computationally intensive and the penalty for multiple testing is severe given the number of interactions possible. Aiming to minimize these statistical and computational burdens, we have explored approaches to prioritise SNPs for interaction analyses.
Primary interaction analyses were performed using the Wellcome Trust Case Control Consortium Bipolar Disorder GWAS (1868 cases, 2938 controls). Replication analyses were performed using the Genetic Association Information Network BD dataset (1001 cases, 1033 controls). SNPs were prioritized for interaction analysis that showed evidence for association that surpassed a number of nominally significant thresholds, are within genome-wide significant genes, or are within genes that are functionally related.
For no set of prioritized SNPs did we obtain evidence to support the hypothesis that the selection strategy identified pairs of variants that were enriched for true (statistical) interactions.
SNPs prioritized according to a number of criteria do not have a raised prior probability for significant interaction that is detectable in samples of this size. As is now widely accepted for single SNP analysis, we argue the use of significance levels reflecting only the number of tests performed does not offer an appropriate degree of protection against the potential for GWAS studies to generate an enormous number of false positive interactions.
GWAS; SNP; epistasis; association; interaction; gene
Renal interstitial fibrosis and glomerular sclerosis are hallmarks of diabetic nephropathy (DN) and several studies have implicated members of the WNT pathways in these pathological processes. This study comprehensively examined common genetic variation within the WNT pathway for association with DN.
Genes within the WNT pathways were selected on the basis of nominal significance and consistent direction of effect in the GENIE meta-analysis dataset. Common SNPs and common haplotypes were examined within the selected WNT pathway genes in a white population with type 1 diabetes, discordant for DN (cases: n = 718; controls: n = 749). SNPs were genotyped using Sequenom or Taqman assays. Association analyses were performed using PLINK, to compare allele and haplotype frequencies in cases and controls. Correction for multiple testing was performed by either permutation testing or using false discovery rate.
A logistic regression model including collection centre, duration of diabetes, and average HbA1c as covariates highlighted three SNPs in GSK3B (rs17810235, rs17471, rs334543), two in DAAM1 (rs1253192, rs1252906) and one in NFAT5 (rs17297207) as being significantly (P < 0.05) associated with DN, however these SNPs did not remain significant after correction for multiple testing. Logistic regression of haplotypes, with ESRD as the outcome, and pairwise interaction analyses did not yield any significant results after correction for multiple testing.
These results indicate that both common SNPs and common haplotypes of WNT pathway genes are not strongly associated with DN. However, this does not completely exclude these or the WNT pathways from association with DN, as unidentified rare genetic or copy number variants could still contribute towards the genetic architecture of DN.
Diabetic nephropathy; WNT signalling pathway; Association study; End-stage renal disease