Studies in European and East Asian populations have identified lung cancer susceptibility loci in nicotinic acetylcholine receptor (nAChR) genes on chromosome 15q25.1 which also appear to influence smoking behaviors. We sought to determine if genetic variation in nAChR genes influences lung cancer susceptibly in African-Americans, and evaluated the association of these cancer susceptibility loci with smoking behavior. A total of 1308 African-Americans with lung cancer and 1241 African-American controls from three centers were genotyped for 378 single nucleotide polymorphisms (SNPs) spanning the sixteen human nAChR genes. Associations between SNPs and the risk of lung cancer were estimated using logistic regression, adjusted for relevant covariates. Seven SNPs in three nAChR genes were significantly associated with lung cancer at a strict Bonferroni-corrected level, including a novel association on chromosome 2 near the promoter of CHRNA1 (rs3755486: OR = 1.40, 95% CI = 1.18-1.67, P = 1.0 × 10−4). Association analysis of an additional 305 imputed SNPs on 2q31.1 supported this association. Publicly available expression data demonstrated that the rs3755486 risk allele correlates with increased CHRNA1 gene expression. Additional SNP associations were observed on 15q25.1 in genes previously associated with lung cancer, including a missense variant in CHRNA5 (rs16969968: OR = 1.60, 95% CI = 1.27-2.01, P = 5.9 × 10−5). Risk alleles on 15q25.1 also correlated with an increased number of cigarettes smoked per day among the controls. These findings identify a novel lung cancer risk locus on 2q31.1 which correlates with CHRNA1 expression and replicate previous associations on 15q25.1 in African-Americans.
Lung cancer; nicotine dependence; African-Americans; genetic association; smoking
Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene–environment interactions. To determine gene–asbestos interactions in lung cancer risk, we conducted genome-wide gene–environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10–6, which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10–5). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene–asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.
Abbreviations:CIconfidence intervalEenvironmentFDRfalse discovery rateGgeneGSEAgene-set-enrichment analysisGWASgenome-wide association studiesi-GSEAimproved gene-set-enrichment analysis approachORodds ratioSNPsingle nucleotide polymorphism
Genome-wide association studies have identified variants on chromosome 15q25.1 that increase the risks of both lung cancer and nicotine dependence and associated smoking behavior. However, there remains debate as to whether the association with lung cancer is direct or is mediated by pathways related to smoking behavior. Here, the authors apply a novel method for mediation analysis, allowing for gene-environment interaction, to a lung cancer case-control study (1992–2004) conducted at Massachusetts General Hospital using 2 single nucleotide polymorphisms, rs8034191 and rs1051730, on 15q25.1. The results are validated using data from 3 other lung cancer studies. Tests for additive interaction (P = 2 × 10−10 and P = 1 × 10−9) and multiplicative interaction (P = 0.01 and P = 0.01) were significant. Pooled analyses yielded a direct-effect odds ratio of 1.26 (95% confidence interval (CI): 1.19, 1.33; P = 2 × 10−15) for rs8034191 and an indirect-effect odds ratio of 1.01 (95% CI: 1.00, 1.01; P = 0.09); the proportion of increased risk mediated by smoking was 3.2%. For rs1051730, direct- and indirect-effect odds ratios were 1.26 (95% CI: 1.19, 1.33; P = 1 × 10−15) and 1.00 (95% CI: 0.99, 1.01; P = 0.22), respectively, with a proportion mediated of 2.3%. Adjustment for measurement error in smoking behavior allowing up to 75% measurement error increased the proportions mediated to 12.5% and 9.2%, respectively. These analyses indicate that the association of the variants with lung cancer operates primarily through other pathways.
gene-environment interaction; lung neoplasms; mediation; pathway analysis; smoking
The common disease/common variant hypothesis has been popular for describing the genetic architecture of common human diseases for several years. According to the originally stated hypothesis, one or a few common genetic variants with a relatively large effect size control the risk of common diseases. A growing body of evidence, however, suggests that rare single-nucleotide polymorphisms (SNPs), i.e., those with a minor allele frequency of less than 5%, are also an important component of the genetic architecture of common human diseases. In this study, we analyzed the relevance of rare SNPs to the risk of common disease from an evolutionary perspective and found that rare SNPs are more likely than common SNPs to be functional and tend to have a stronger effect size than do common SNPs. This observation, plus the fact that most of the SNPs in the human genome are rare, suggests that rare SNPs are a crucial element of the genetic architecture of common human diseases. We propose that the next generation of genomic studies should focus on analyzing rare SNPs. Further, targeting patients with a family history of the disease, an extreme phenotype, or early disease onset may facilitate the detection of risk-associated rare SNPs.
Single Nucleotide Polymorphisms (SNPs); Genome Wide Association Studies (GWAS); Minor Allele Frequency (MAF); negative selection
We explored the contribution of nitrosamine metabolism to lung cancer in a pilot investigation of genetic variation in CYP2B6, a high-affinity enzymatic activator of tobacco-specific nitrosamines with a negligible role in nicotine metabolism. Previously we found that variation in CYP2A6 and CHRNA5-CHRNA3-CHRNB4 combined to increase lung cancer risk in a case-control study in European American ever-smokers (n = 860). However, these genes are involved in the pharmacology of both nicotine, through which they alter smoking behaviours, and carcinogenic nitrosamines. Herein, we separated participants by CYP2B6 genotype into a high- vs. low-risk group (*1/*1 + *1/*6 vs. *6/*6). Odds ratios estimated through logistic regression modeling were 1.25 (95% CI 0.68–2.30), 1.27 (95% CI 0.89–1.79) and 1.56 (95% CI 1.04–2.31) for CYP2B6, CYP2A6 and CHRNA5-CHRNA3-CHRNB4, respectively, with negligible differences when all genes were evaluated concurrently. Modeling the combined impact of high-risk genotypes yielded odds ratios that rose from 2.05 (95% CI 0.39–10.9) to 2.43 (95% CI 0.47–12.7) to 3.94 (95% CI 0.72–21.5) for those with 1, 2 and 3 vs. 0 high-risk genotypes, respectively. Findings from this pilot point to genetic variation in CYP2B6 as a lung cancer risk factor supporting a role for nitrosamine metabolic activation in the molecular mechanism of lung carcinogenesis.
CYP2B6; CYP2A6; CHRNA5-CHRNA3-CHRNB4; tobacco specific nitrosamines; lung cancer risk; genetic variation
Risk prediction models are useful in clinical decision making. We have published an internally validated prediction tool for lung cancer based on easily obtainable epidemiologic and clinical data. Because the precision of the model was modest, we now estimate the improvement obtained by adding two markers of DNA repair capacity.
Assay data (host-cell reactivation and mutagen sensitivity) were available for 725 White lung cancer cases and 615 controls, all former or current smokers, a subset of cases and controls from the previous analysis. Multivariable models were constructed from the original variables with addition of the biomarkers separately and together. Pairwise comparisons of the area under the receiver operating characteristic curves (AUC) and 3-fold cross-validations were done.
For former smokers, the AUC and 95% confidence intervals were 0.67 (0.63–0.71) for the baseline model and 0.70 (0.66–0.74) for the expanded model. For current smokers, the comparable AUC values were 0.68 (0.64–0.72) and 0.73 (0.69–0.77). For both groups, the expanded models were statistically significantly better than the baseline models (P = 0.006 and P = 0.0048, respectively), although the increases in the concordance statistics were modest. We also recomputed 1-year absolute risks of lung cancer as described previously for two different risk profiles and showed that individuals who exhibited poor repair capacity or heightened mutagen sensitivity had increased absolute risks of lung cancer.
Addition of biomarker assays improved the sensitivity of the expanded models.
Common variants in the nicotinic acetylcholine receptor gene cluster on chromosome 15q24-25.1 were associated with lung cancer risk in three recently published independently conducted genome-wide association studies, with no consensus as to the relative impact of the variants on the propensity to smoke vs a direct carcinogenic effect. To further explore our hypothesis that these variants are indeed associated with both cancer causation and nicotine dependence, we performed a more detailed analysis of the association of these putative risk genotypes with smoking phenotype, as well as in lifetime never smokers, and in other smoking-related cancers. We demonstrate a statistically significant association of the variants with both nicotine dependence, as well as lung cancer phenotypes, including earlier age at lung cancer onset. The variants were associated with higher risks of lung cancer in lower smoking-exposed strata, and in individuals with a strong family history of lung or smoking-related cancers. In contrast, we found no evidence that the variants were associated with elevated risks in 547 lifetime never-smoking lung cancer case subjects, nor in other smoking-related cancers (bladder and renal). Thus, we conclude that the variants are implicated both in smoking behavior and more directly in lung cancer risk.
Tumor size at diagnosis (TSD) indirectly reflects tumor growth rate. The relationship between TSD and smoking is poorly understood. The aim of the study was to determine the relationship between smoking and TSD. We reviewed 1712 newly diagnosed and previously untreated non-small cell lung cancer (NSCLC) patients’ electronic medical records and collected tumor characteristics. Demographic and epidemiologic characteristics were derived from questionnaires administered during personal interviews. Univariate and multivariate linear regression models were used to evaluate the relationship between TSD and smoking controlling for demographic and clinical factors. We also investigated the relationship between the rs1051730 SNP in an intron of the CHRNA3 gene (the polymorphism most significantly associated with lung cancer risk and smoking behavior) and TSD. We found a strong dose dependent relationship between TSD and smoking. Current smokers had largest and never smokers smallest TSD with former smokers having intermediate TSD. In the multivariate linear regression model, smoking status (never, former, and current), histological type (adenocarcinoma vs SqCC), and gender were significant predictors of TSD. Smoking duration and intensity may explain the gender effect in predicting TSD. We found that the variant allele of rs1051730 in CHRNA3 gene was associated with larger TSD of squamous cell carcinoma. In the multivariate linear regression model, both rs1051730 and smoking were significant predictors for the size of squamous carcinomas. We conclude that smoking is positively associated with lung tumor size at the moment of diagnosis.
Lung cancer; tumor size; epidemiologic characteristics; risk factors; CHRNA3
Genetic variants located at 15q25, including those in the cholinergic receptor nicotinic cluster (CHRNA5) have been implicated in both lung cancer risk and nicotine dependence in recent genome-wide association studies. Among these variants, a 22 base pair insertion/deletion, rs3841324 showed the strongest association with CHRNA5 mRNA expression levels. However the influence of rs3841324 on lung cancer risk has not been studied in depth.
We have therefore evaluated the association of rs3841324 genotypes with lung cancer risk in a case-control study of 624 Caucasian subjects with lung cancer and 766 age- and sex-matched cancer-free Caucasian controls. We also evaluated the joint effects of rs3841324 with single-nucleotide polymorphisms (SNPs) rs16969968 and rs8034191 in the 15q25 region that have been consistently implicated in lung cancer risk.
We found that the homozygous genotype with both short alleles (SS) of rs3841324 was associated with a decreased lung cancer risk in female ever smokers relative to the homozygous wild-type (LL) and heterozygous (LS) genotypes combined in a recessive model (OR adjusted = 0.55, 95% CI = 0.31–0.89, P = 0.0168). There was no evidence for a sex difference in the association between this variant and cigarettes smoked per day (CPD). Diplotype analysis of rs3841324 with either rs16969968 or rs8034191 showed that these polymorphisms influenced the lung cancer risk independently.
Conclusions and impact
This study has shown a sex difference in the association between the 15q25 variant rs3841324 and lung cancers. Further research is warranted to elucidate the mechanisms underlying these observations.
lung cancer; CHRNA5; Chromosome 15q25; rs3841324; sex-specific association
Tobacco-induced lung cancer is characterized by a deregulated inflammatory microenvironment. Variants in multiple genes in inflammation pathways may contribute to risk of lung cancer.
We therefore conducted a three-stage comprehensive pathway analysis (discovery, replication and meta-analysis) of inflammation gene variants in ever smoking lung cancer cases and controls. A discovery set (1096 cases; 727 controls) and an independent and non-overlapping internal replication set (1154 cases; 1137 controls) were derived from an ongoing case-control study. For discovery, we used an iSelect BeadChip to interrogate a comprehensive panel of 11737 inflammation pathway SNPs and selected nominally significant (p<0.05) SNPs for internal replication.
There were 6 SNPs that achieved statistical significance (p<0.05) in the internal replication dataset with concordant risk estimates for former smokers and 5 concordant and replicated SNPs in current smokers. Replicated hits were further tested in a subsequent meta-analysis using external data derived from two published GWAS and a case-control study. Two of these variants (a BCL2L14 SNP in former smokers and a SNP in IL2RB in current smokers) were further validated. In risk score analyses, there was a 26% increase in risk with each additional adverse allele when we combined the genotyped SNP and the most significant imputed SNP in IL2RB in current smokers and a 36% similar increase in risk for former smokers associated with genotyped and imputed BCL2L14 SNPs.
Before they can be applied for risk prediction efforts, these SNPs should be subject to further external replication and more extensive fine mapping studies.
Inflammation SNPS; lung cancer; smokers
Platinum-based regimens are the standard chemotherapy for patients with advanced non–small-cell lung cancer (NSCLC). DNA repair capacity (DRC) in tumor cells plays an important role in resistance to platinum-based drugs. We have previously reported that efficient DRC, as assessed by an in vitro lymphocyte-based assay, was a determinant of poor survival in patients with NSCLC in a relatively small data set. In this larger independent study of 591 patients with NSCLC, we further evaluated whether DRC in peripheral lymphocytes predicts survival of patients with NSCLC who receive platinum-based chemotherapy.
Patients and Methods
All patients were recruited at The University of Texas MD Anderson Cancer Center and donated blood samples before the start of any chemotherapy. We measured DRC in cultured T lymphocytes by using the host-cell reactivation assay, and we assessed associations between DRC in peripheral lymphocytes and survival of patients with NSCLC who were treated with first-line platinum-based chemotherapy.
We found an inverse association between DRC in peripheral lymphocytes and patient survival. Compared with patients in the low tertile of DRC, patients with NSCLC in the high tertile of DRC had significantly worse overall and 3-year survival (adjusted hazard ratio [HR], 1.33; 95% CI, 1.04 to 1.71; P = .023; and HR, 1.35; 95% CI, 1.04 to 1.76; P = .025, respectively). This trend was more pronounced in patients with early-stage tumors, adenocarcinoma, or squamous cell carcinoma.
We confirmed that DRC in peripheral lymphocytes is an independent predictor of survival for patients with NSCLC treated with platinum-based chemotherapy.
A mediation model explores the direct and indirect effects between an independent variable and a dependent variable by including other variables (or mediators). Mediation analysis has recently been used to dissect the direct and indirect effects of genetic variants on complex diseases using case-control studies. However, bias could arise in the estimations of the genetic variant-mediator association because the presence or absence of the mediator in the study samples is not sampled following the principles of case-control study design. In this case, the mediation analysis using data from case-control studies might lead to biased estimates of coefficients and indirect effects. In this article, we investigated a multiple-mediation model involving a three-path mediating effect through two mediators using case-control study data. We propose an approach to correct bias in coefficients and provide accurate estimates of the specific indirect effects. Our approach can also be used when the original case-control study is frequency matched on one of the mediators. We employed bootstrapping to assess the significance of indirect effects. We conducted simulation studies to investigate the performance of the proposed approach, and showed that it provides more accurate estimates of the indirect effects as well as the percent mediated than standard regressions. We then applied this approach to study the mediating effects of both smoking and chronic obstructive pulmonary disease (COPD) on the association between the CHRNA5-A3 gene locus and lung cancer risk using data from a lung cancer case-control study. The results showed that the genetic variant influences lung cancer risk indirectly through all three different pathways. The percent of genetic association mediated was 18.3% through smoking alone, 30.2% through COPD alone, and 20.6% through the path including both smoking and COPD, and the total genetic variant-lung cancer association explained by the two mediators was 69.1%.
Chromosome 5p15.33 has been identified by genome-wide association studies as one of the regions that associate with lung cancer risk. A few single-nucleotide polymorphisms (SNPs) in the telomerase reverse transcriptase (TERT) and cleft lip and palate transmembrane 1-like (CLPTM1L) genes located in this region have shown consistent associations. We performed dense genotyping of SNPs in this region to refine the previously reported association signals for lung cancer risk. Two hundred and fifteen SNPs were genotyped on an Illumina iSelect panel, in a hospital-based case–control study of 1681 lung cancer cases and 1235 unaffected controls. Association was tested using unconditional logistic regression, while adjusting for age, sex and pack-years smoked. Furthermore, since many of the SNPs were in linkage disequilibrium (LD), haplotype blocks were constructed, from which tagging SNPs at an r2 threshold of ≥0.95 were included in a stepwise forward selection logistic regression model. Of the 215 SNPs, 69 were significant at P < 0.05 in univariate analysis; of these, 35 SNPs meeting the r2 threshold were included in the multiple logistic regression model. Two SNPs, rs370348 (odds ratio = 0.76, P = 1.6 × 10−6) and rs4975538 (odds ratio = 1.18, P = 0.005), significantly associated with risk in the overall sample. Among ever smokers, rs4975615 (odds ratio = 0.75, P = 1.2 × 10−4) and rs4975538 (odds ratio = 1.26, P = 0.002) were significant, whereas among never-smokers, rs451360 (odds ratio = 0.62, P = 7.6 × 10−5) was significant. We refined the consistent association signal in this region, allowing for the considerable LD between SNPs and identified four novel SNPs that were independently and significantly associated with lung cancer risk. Results of these analyses strongly suggest effects on risk from several loci in the TERT/CLPTM1L region.
Genetic variations in the CYP2A6 nicotine metabolic gene and the CHRNA5-CHRNA3-CHRNB4 (CHRNA5-A3-B4) nicotinic gene cluster have been independently associated with lung cancer. With genotype data from ever-smokers of European ancestry (417 lung cancer patients and 443 control subjects), we investigated the relative and combined associations of polymorphisms in these two genes with smoking behavior and lung cancer risk. Kruskal–Wallis tests were used to compare smoking variables among the different genotype groups, and odds ratios (ORs) for cancer risk were estimated using logistic regression analysis. All statistical tests were two-sided. Cigarette consumption (P < .001) and nicotine dependence (P = .036) were the highest in the combined CYP2A6 normal metabolizers and CHRNA5-A3-B4 AA (tag single-nucleotide polymorphism rs1051730 G>A) risk group. The combined risk group also exhibited the greatest lung cancer risk (OR = 2.03; 95% confidence interval [CI] = 1.21 to 3.40), which was even higher among those who smoked 20 or fewer cigarettes per day (OR = 3.03; 95% CI = 1.38 to 6.66). Variation in CYP2A6 and CHRNA5-A3-B4 was independently and additively associated with increased cigarette consumption, nicotine dependence, and lung cancer risk. CYP2A6 and CHRNA5-A3-B4 appear to be more strongly associated with smoking behaviors and lung cancer risk, respectively.
While lung cancer is largely caused by tobacco smoking, inherited genetic factors play a role in its etiology. Genome-wide association studies (GWAS) in Europeans have robustly demonstrated only three polymorphic variations influencing lung cancer risk. Tumor heterogeneity may have hampered the detection of association signal when all lung cancer subtypes were analyzed together. In a GWAS of 5,355 European smoking lung cancer cases and 4,344 smoking controls, we conducted a pathway-based analysis in lung cancer histologic subtypes with 19,082 SNPs mapping to 917 genes in the HuGE-defined “inflammation” pathway. We identified a susceptibility locus for squamous cell lung carcinoma (SQ) at 12p13.33 (RAD52, rs6489769), and replicated the association in three independent samples totaling 3,359 SQ cases and 9,100 controls (odds ratio=1.20, Pcombined=2.3×10−8).
The combination of pathway-based approaches and information on disease specific subtypes can improve the identification of cancer susceptibility loci in heterogeneous diseases.
Lung cancer; histology; squamous cell carcinoma; pathway analysis; RAD52
Interindividual variation in genetic background may influence the response to chemotherapy and overall survival for patients with advanced-stage non–small cell lung cancer (NSCLC).
To identify genetic variants associated with poor overall survival in these patients, we conducted a genome-wide scan of 307 260 single-nucleotide polymorphisms (SNPs) in 327 advanced-stage NSCLC patients who received platinum-based chemotherapy with or without radiation at the University of Texas MD Anderson Cancer Center (the discovery population). A fast-track replication was performed for 315 patients from the Mayo Clinic followed by a second validation at the University of Pittsburgh in 420 patients enrolled in the Spanish Lung Cancer Group PLATAX clinical trial. A pooled analysis combining the Mayo Clinic and PLATAX populations or all three populations was also used to validate the results. We assessed the association of each SNP with overall survival by multivariable Cox proportional hazard regression analysis. All statistical tests were two-sided.
SNP rs1878022 in the chemokine-like receptor 1 (CMKLR1) was statistically significantly associated with poor overall survival in the MD Anderson discovery population (hazard ratio [HR] of death = 1.59, 95% confidence interval [CI] = 1.32 to 1.92, P = 1.42 × 10−6), in the PLATAX clinical trial (HR of death = 1.23, 95% CI = 1.00 to 1.51, P = .05), in the pooled Mayo Clinic and PLATAX validation (HR of death = 1.22, 95% CI = 1.06 to 1.40, P = .005), and in pooled analysis of all three populations (HR of death = 1.33, 95% CI = 1.19 to 1.48, P = 5.13 × 10−7). Carrying a variant genotype of rs10937823 was associated with decreased overall survival (HR of death = 1.82, 95% CI = 1.42 to 2.33, P = 1.73 × 10−6) in the pooled MD Anderson and Mayo Clinic populations but not in the PLATAX trial patient population (HR of death = 0.96, 95% CI = 0.69 to 1.35).
These results have the potential to contribute to the future development of personalized chemotherapy treatments for individual NSCLC patients.
DNA repair genes are important for maintaining genomic stability and limiting carcinogenesis. We analyzed all single nucleotide polymorphisms (SNPs) of 125 DNA repair genes covered by the Illumina HumanHap300 (v1.1) BeadChips in a previously conducted genome-wide association study (GWAS) of 1,154 lung cancer cases and 1,137 controls and replicated the top-hits of XRCC4 SNPs in an independent set of 597 cases and 611 controls in Texas populations. We found that six of 20 XRCC4 SNPs were associated with a decreased risk of lung cancer with a P value of 0.01 or lower in the discovery dataset, of which the most significant SNP was rs10040363 (P for allelic test = 4.89 ×10−4). Moreover, the data in this region allowed us to impute a potentially functional SNP rs2075685 (imputed P for allelic test = 1.3 ×10−3). A luciferase reporter assay demonstrated that the rs2075685G>T change in the XRCC4 promoter increased expression of the gene. In the replication study of rs10040363, rs1478486, rs9293329, and rs2075685, however, only rs10040363 achieved a borderline association with a decreased risk of lung cancer in a dominant model (adjusted OR = 0.80, 95% CI = 0.62–1.03, P = 0.079). In the final combined analysis of both the Texas GWAS discovery and replication datasets, the strength of the association was increased for rs10040363 (adjusted OR = 0.77, 95% CI = 0.66–0.89, Pdominant = 5×10−4 and P for trend = 5×10−4) and rs1478486 (adjusted OR = 0.82, 95% CI = 0.71 −0.94, Pdominant = 6×10−3 and P for trend = 3.5×10−3). Finally, we conducted a meta-analysis of these XRCC4 SNPs with available data from published GWA studies of lung cancer with a total of 12,312 cases and 47,921 controls, in which none of these XRCC4 SNPs was associated with lung cancer risk. It appeared that rs2075685, although associated with increased expression of a reporter gene and lung cancer risk in the Texas populations, did not have an effect on lung cancer risk in other populations. This study underscores the importance of replication using published data in larger populations.
XRCC4; variant; Genetic susceptibility; genome-wide association study; replication study
Lung cancer continues to be the leading cause of cancer death in the USA and the best example of a cancer with undisputed evidence of environmental risk. However, a genetic contribution to lung cancer has also been demonstrated by studies of familial aggregation, family-based linkage, candidate gene studies and most recently genome-wide association studies (GWAS). The African-American population has been underrepresented in these genetic studies and has patterns of cigarette use and linkage disequilibrium that differ from patterns in other populations. Therefore, studies in African-Americans can provide complementary data to localize lung cancer susceptibility genes and explore smoking dependence-related genes. We used admixture mapping to further characterize genetic risk of lung cancer in a series of 837 African-American lung cancer cases and 975 African-American controls genotyped at 1344 ancestry informative single-nucleotide polymorphisms. Both case-only and case–control analyses were conducted using ADMIXMAP adjusted for age, sex, pack-years of smoking, family history of lung cancer, history of emphysema and study site. In case-only analyses, excess European ancestry was observed over a wide region on chromosome 1 with the largest excess seen at rs6587361 for non-small-cell lung cancer (NSCLC) (Z-score = −4.33; P = 1.5 × 10−5) and for women with NSCLC (Z-score = −4.82; P = 1.4 × 10−6). Excess African ancestry was also observed on chromosome 3q with a peak Z-score of 3.33 (P = 0.0009) at rs181696 among ever smokers with NSCLC. These results add to the findings from the GWAS in Caucasian populations and suggest novel regions of interest.
Pathway analysis has been proposed as a complement to single SNP analyses in GWAS. This study compared pathway analysis methods using two lung cancer GWAS data sets based on four studies: one a combined data set from Central Europe and Toronto (CETO); the other a combined data set from Germany and MD Anderson (GRMD). We searched the literature for pathway analysis methods that were widely used, representative of other methods, and had available software for performing analysis. We selected the programs EASE, which uses a modified Fishers Exact calculation to test for pathway associations, GenGen (a version of Gene Set Enrichment Analysis (GSEA)), which uses a Kolmogorov-Smirnov-like running sum statistic as the test statistic, and SLAT, which uses a p-value combination approach. We also included a modified version of the SUMSTAT method (mSUMSTAT), which tests for association by averaging χ2 statistics from genotype association tests. There were nearly 18000 genes available for analysis, following mapping of more than 300,000 SNPs from each data set. These were mapped to 421 GO level 4 gene sets for pathway analysis. Among the methods designed to be robust to biases related to gene size and pathway SNP correlation (GenGen, mSUMSTAT and SLAT), the mSUMSTAT approach identified the most significant pathways (8 in CETO and 1 in GRMD). This included a highly plausible association for the acetylcholine receptor activity pathway in both CETO (FDR≤0.001) and GRMD (FDR = 0.009), although two strong association signals at a single gene cluster (CHRNA3-CHRNA5-CHRNB4) drive this result, complicating its interpretation. Few other replicated associations were found using any of these methods. Difficulty in replicating associations hindered our comparison, but results suggest mSUMSTAT has advantages over the other approaches, and may be a useful pathway analysis tool to use alongside other methods such as the commonly used GSEA (GenGen) approach.
Mutagen sensitivity, a measurement of chromatid breaks induced by various mutagens in short-term cultures of peripheral blood lymphocytes, is an established risk factor for a number of cancers and is highly heritable. The purpose of this study is to identify genetic predictors of mutagen sensitivity. Therefore, we conducted a multi-stage genome-wide association study. The primary scan analyzed 539 437 autosomal SNPs in 673 healthy individuals, followed by validations in two independent sets of 575 and 259 healthy individuals, respectively. One SNP, rs8093763, on chromosome 18q21 showed significant association with bleomycin (BLM) sensitivity (combined P = 2.64 × 10−8). We observed significantly lower BLM-induced chromotid breaks for genotypes containing wild-type allele compared with the homozygous variant genotype in the discovery set (0.71 versus 0.90, P= 3.77 × 10−5) and in replication phase 1 (0.61 versus 0.84, P= 7.00 × 10−5). The result of replication phase 2 was not statistically significant (0.65 versus 0.68, P= 0.44). This SNP is approximately 64 kb from PMAIP1/Noxa, which is a radiation-inducible gene and exhibits higher expression in BLM-sensitive lymphoblastoid cell lines than insensitive cell lines upon BLM treatment. In conclusion, we identified a biologically plausible genetic variant on 18q21 near the PMAIP1/Noxa gene that is associated with BLM sensitivity.
Genome-wide association studies of white persons with lung cancer have identified a region of extensive linkage disequilibrium on chromosome 15q25.1 that appears to be associated with both risk for lung cancer and smoking dependence. Because studying African American persons, who exhibit lower levels of linkage disequilibrium in this region, may identify additional loci that are associated with lung cancer, we genotyped 34 single-nucleotide polymorphisms (SNPs) in this region (including LOC123688, PSMA4, CHRNA5, CHRNA3, and CHRNB4 genes) in 467 African American patients with lung cancer and 388 frequency-matched African American control subjects. Associations of SNPs in LOC123688 (rs10519203; odds ratio [OR] = 1.60, 95% confidence interval [CI] = 1.25 to 2.05, P = .00016), CHRNA5 (rs2036527; OR = 1.67, 95% CI = 1.26 to 2.21, P = .00031), and CHRNA3 (rs1051730; OR = 1.81, 95% CI = 1.26 to 2.59, P = .00137) genes with lung cancer risk reached Bonferroni-corrected levels of statistical significance (all statistical tests were two-sided). Joint logistic regression analysis showed that rs684513 (OR = 0.47, 95% CI = 0.31 to 0.71, P = .0003) in CHRNA5 and rs8034191 (OR = 1.76, 95% CI = 1.23 to 2.52, P = .002) in LOC123688 were also associated with risk. The functional A variant of rs1696698 in CHRNA5 had the strongest association with lung cancer (OR = 1.98, 95% CI = 1.25 to 3.11, P = .003). These SNPs were primarily associated with increased risk for lung adenocarcinoma histology and were only weakly associated with smoking phenotypes. Thus, among African American persons, multiple loci in the region of chromosome 15q25.1 appear to be strongly associated with lung cancer risk.
Recent genome-wide association (GWA) studies of lung cancer have shown that the CHRNA5-A3 region on chromosome 15q24-25.1 is strongly associated with an increased risk of lung cancer and nicotine dependence, and thought to be associated with chronic obstructive airways disease as well. However, it has not been established whether the association between genetic variants and lung cancer risk is a direct one or one mediated by nicotine dependence.
In this paper we applied a rigorous statistical approach, mediation analysis, to examine the mediating effect of smoking behavior and self-reported physician-diagnosed emphysema (chronic obstructive pulmonary disease [COPD]) on the relationship between the CHRNA5-A3 region genetic variant rs1051730 and the risk of lung cancer.
Our results showed that rs1051730 is directly associated with lung cancer risk, but that it is also associated with lung cancer risk through its effect on both smoking behavior and COPD. Furthermore, we showed that COPD is a mediating phenotype that explains part of the effect of smoking behavior on lung cancer. Our results also suggested that smoking behavior is a mediator of the relationship between rs1051730 and COPD risk.
Smoking behavior and COPD are mediators of the association between the SNP rs1051730 and the risk of lung cancer. Also, COPD is a mediator of the association between smoking behavior and lung cancer. Finally, smoking behavior also has mediating effects on the association between the SNP and COPD.
Lung Cancer; COPD; Mediation analysis; smoking behavior; genetic variants
Although cigarette smoking and alcohol use are known risk factors for squamous cell carcinoma of head and neck (SCCHN), only a few exposed individuals develop this disease, suggesting an individual susceptibility. In this study, we investigated the associations between genetically determined DNA repair capacity (DRC) for removing tobacco-induced DNA adducts and risk of SCCHN and tumor characteristics.
We measured DRC in cultured T-lymphocytes using the host-cell reactivation assay in a hospital-based case-control study of 744 SCCHN patients and 753 age-, sex-and ethnicity-matched cancer-free controls recruited from The University of Texas M. D. Anderson Cancer Center.
Patients with SCCHN had significantly lower mean DRC (8.84% ± 2.68%) than controls (9.97% ± 2.61%) (P < 0.0001), and the difference accounted for approximately 2-fold increased risk of SCCHN (adjusted odds ratio [OR], 1.91; 95% CI, 1.52–2.40) after adjustment for other covariates. Compared with the highest DRC quartile of controls, this increased risk was dose-dependent (second highest quartile: OR, 1.40; 95% CI, 0.99–1.98; third quartile: OR, 1.87; 95% CI, 1.34–2.62; and fourth quartile: OR, 2.76; 95% CI, 1.98–3.84, respectively; Ptrend < 0.0001). We also assessed the performance of DRC in risk prediction models by calculating the area of under the receiver operating characteristic curve. The addition of DRC to the model significantly improved the sensitivity of the expanded model. However, we did not find the association between DRC and tumor sites and stages.
DRC is an independent susceptibility biomarker for SCCHN risk but not a tumor marker.
nucleotide excision repair; genetic susceptibility; head and neck neoplasm; molecular epidemiology
To explore the impact of common variation on the risk of developing lung cancer we conducted a two-phase genome-wide association (GWA) study. In Phase 1, we compared the genotypes of 511,919 tagging single nucleotide polymorphisms (tagSNPs) in 1,952 cases and 1,438 controls; in Phase 2, 30,568 SNPs were genotyped in 2,465 cases and 3,005 controls. SNP selection was based on best supported P-values from Phase 1 and two other GWA studies of lung cancer. In the combined analysis of Phases 1 and 2, the strongest associations identified were defined by SNPs mapping to 15q25.1 (rs12914385; P = 3.19 × 10−16), 5p15.33 (rs4975616; P = 6.66 × 10−7), and 6p21.33 (rs3117582; P = 9.13 × 10−7). Variation at 15q25.1, but not 5p15.33 or 6p21.33, was strongly associated with smoking behaviour with risk alleles correlated to higher consumption. Variation at 5p15.33 was shown to significantly influence induction of lung cancer histology. Pooling data from the four series provided 21,620 genotypes for 7,560 cases and 8,205 controls. A meta-analysis provided increased support that variation at 15q25.1 (rs8034191; P = 3.24 × 10−26), 5p15.33 (rs4975616; P = 2.99 × 10−9), and 6p21.33 (rs3117582; P = 4.46 × 10−10) influences lung cancer risk. The next best-supported associations were attained at 15q15.2 (rs748404: P = 1.08 × 10−6) and 10q23.31 (rs1926203; P = 1.28 × 10−6). These data indicate few common variants account for 1% of the excess familial risk underscoring the necessity of having additional large sample series for gene discovery.
lung cancer; genome-wide association